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METHODS FOR INCREASING THE METHIONINE 
CONTENT OF THE SEEDS OF PLANTS 
CROSS-REFERENCE TO RELATED AP PLICATIONS 
This is a continuation-in-part of Application No. 08/703,829, filed on August 27, 



1996. 



TECHNICAL FIELD 
This invention relates to a nucleic acid fragment encoding a plant methionine 
synthase or methionine synthase. The invention also includes chimeric genes, a first 
encoding a plant methionine synthase (MS) gene, a second encoding a plant cystathionine 
y-synthase (CS) gene, a third encoding feedback-insensitive aspartokinase (AK) or 
bifunctional feedback-insensitive aspartokinase-homoserine dehydrogenase (AK-HDH), 
which is operably linked to a plant chloroplast transit sequence, and a fourth encoding a 
methionine-rich protein, all operably linked to plant seed-specific regulatory sequences. 
Methods for their use to produce increased levels of methionine in the seeds of transformed 
plants are provided. 

BACKGROUND OF THE INVENTION 
Human food and animal feed derived from many grains are deficient in the sulfur 
amino acids, methionine and cysteine, which are required in an animal diet. In corn, the 
sulfur amino acids are the third most limiting amino acids, after lysine and tryptophan, for 
the dietary requirements of many animals. The use of soybean meal, which is rich in lysine 
and tryptophan, to supplement corn in animal feed is limited by the low sulfur amino acid 
content of the legume. Thus, an increase in the sulfur amino acid content of either corn or 
soybean would improve the nutritional quality of the mixtures and reduce the need for 
further supplementation through addition of more expensive methionine. 

Efforts to improve the sulfur amino acid content of crops through plant breeding have 
met with limited success on the laboratory scale and no success on the commercial scale. A 
mutant corn line which had an elevated whole-kernel methionine concentration was isolated 
from corn cells grown in culture by selecting for growth in the presence of inhibitory 
concentrations of lysine plus threonine [Phillips et al., Cereal Chem., (1985), 62, 213-218]. 
However, agronomically-acceptable cultivars have not yet been derived from this line and 
commercialized. Soybean cell lines with increased intracellular concentrations of 
methionine were isolated by selection for growth in the presence of ethionine [Madison and 
Thompson, Plant Cell Reports, (1988), 7, 472-476], but plants were not regenerated from 
these lines. 

The amino acid content of seeds is determined primarily by the storage proteins 
which are synthesized during seed development and which serve as a major nutrient reserve 
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following germination. The quantity of protein in seeds varies from about 10% of the dry 
weight in cereals to 20-40% of the dry weight of legumes. In many seeds the storage 
proteins account for 50% or more of the total protein. Because of their abundance, plant 
seed storage proteins were among the first proteins to be isolated. Only recently, however, 
5 have the amino acid sequences of some of these proteins been determined with the use of 
molecular genetic techniques. These techniques have also provided information about the 
genetic signals that control the seed-specific expression and the intracellular targeting of 
these proteins. 

One genetic engineering approach to increase the sulfur amino acid content of seeds 

10 is to isolate genes coding for proteins that are rich in the sulfur-containing amino acids 

methionine and cysteine, to link the genes to strong seed-specific regulatory sequences, to 
transform the chimeric gene into crops plants and to identify transformants wherein the gene 
is sufficiently-highly expressed to cause an increase in total sulfur amino acid content. 
However, increasing the sulfur amino acid content of seeds by expression of sulfur-rich 

1 5 proteins may be limited by the ability of the plant to synthesize methionine, by the synthesis 
and stability of the methionine-rich protein, and by effects of over-accumulation of the 
methionine-rich protein on the viability of the transgenic seeds. 

An alternative approach would be to increase the production and accumulation of the 
free amino acid, methionine, via genetic engineering technology. However, little guidance is 

20 available on the control of the biosynthesis and accumulation of methionine in plants, 
particularly in the seeds of plants. 

Methionine, along with threonine, lysine and isoleucine, are amino acids derived 
from aspartate. The first step in the pathway is the phosphorylation of aspartate by the 
enzyme aspartokinase (AK), and this enzyme has been found to be an important target for 

25 regulation of the pathway in many organisms. The aspartate family pathway is also believed 
to be regulated at the branch-point reactions. For methionine the reduction of aspartyl 
p-semialdehyde by homoserine dehydrogenase (HDH) may be an important point of control. 
The first committed step to methionine, the production of cystathionine from 
O-phosphohomoserine and cysteine by cystathionine y-synthase (CS), appears to be an 

30 important point of control of flux through the methionine pathway [Giovanelli et al., Plant 
Physiol, (1984), 77, 450-455]. The final step in methionine biosynthesis is catalyzed by the 
enzyme 5-methyltetrahydropteroyltriglutamate-homocysteine methyltransferase, also known 
as methionine synthase (MS). Nucleic acid fragments encoding full-length vitamin-B12 
independent methionine synthases from Madagascar periwinkle {Catharanthus roseus) 

35 [Eichel et al., Eur. J. Biochem. (1995), 230, 1053-1058] Coleus (Solenostemon 

scutellarioides) [Petersen et al., Plant Physiol. (1995), 109, 338], Arabidopsis thaliand 
[Ravanel et al., Proc. Natl. Acad. Sci. USA (1998), 95, 7805-7812], and Mesembryanthemum 
crystallinum [NCBI General Identification No. 1814403], as well as nuceic acid fragments 
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encoding a portion of vitamin-B12 independent methionine synthase from a number of plant 
species such as soybean, rice, and corn have been disclosed previously. 

: ST JMMARY O F THF INVENTION 
The present invention provides plant genes encoding MS, specifically tobacco, corn 
5 and soybean MS genes, additional MS nucleic acid fragments from wheat, as well as 

chimeric MS genes for seed-specific over-expression of the plant enzyme. Combinations of 
these genes with other chimeric genes encoding AK, HDH, CS and methionine-rich seed 
storage protein provide methods to increase the level of methionine in seeds. 

More specifically, the present invention concerns an isolated nucleic acid fragment 
1 0 comprising a nucleotide sequence selected from the group consisting of (a) a nucleotide 

sequence corresponding to any of the nucleotide sequences set forth in SEQ ID NOS:l, 3, 5, 
• 7 or 9 or the complement thereof, or (b) the nucleotide sequence of (a) wherein said sequence 
is degenerate in accordance with the degeneracy of the genetic code. 

In a second embodiment, this invention concerns an isolated nucleic acid fragment 

15 comprising: 

(a) a first nucleic acid fragment comprising a nucleotide sequence selected from 
the group consisting of (a) a nucleotide sequence corresponding to any of the nucleotide 
sequences set forth in SEQ ID NOS:I, 3, 5, 7 or 9 or the complement thereof, or (b) the 
nucleotide sequence of (a) wherein said sequence is degenerate in accordance with the 

20 degeneracy of the genetic code, and 

(b) a second nucleic acid fragment encoding a plant cystathionine y-synthase or 

a functionally equivalent subfragment thereof. 

In a third embodiment, this invention concerns chimeric genes comprising the 
isolated nucleic acid fragments discussed above operably linked to regulatory sequences. 
25 In a fourth embodiment, this invention concerns plants and transformed hosts 

comprising such chimeric genes in their genome and seeds obtained from such plants, 

In a fifth embodiment, this invention concerns a polypeptide comprising all or a 
substantial portion of the amino acid sequence set forth in SEQ ID NOS:2, 4, 6, 8 and 10. 

In a sixth embodiment, this invention concerns a method for increasing methionine 
30 content of the seeds of plants comprising: 

(a) transforming plant cells with the chimeric genes discussed above or the 

nucleic acid fragment discussed above; 

(b) growing fertile mature plants from the untransformed plant cells obtained 
from step (a) under conditions suitable to obtain seeds; and 

3 5 selecting progeny seed of step (b) for those seeds containing increased levels of methionine 
compared to untransformed seeds. 

In a seventh embodiment, this invention concerns a method for producing plant 
methionine synthase comprising the following steps: 

3 
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(a) transforming microbial host cells with a chimeric gene wherein a nucleic acid 
fragment encoding a plant methionine synthase is operably linked to regulatory sequences 
capable of expression in microbial cells; then 

(b) growing the transformed microbial cells obtained from step (a) under 
5 conditions that result in expression of the methionine synthase protein. 

In an eighth embodiment,*** invention concerns a method for evaluating at least one 
compound for its ability to inhibit the activity of a plant methionine synthase, the method 
comprising the steps of: (a) transforming a host cell with a chimeric gene comprising a 
nucleic acid fragment encoding a plant methionine synthase, operably linked to suitable 

1 0 regulatory sequences; (b) growing the transformed host cell under conditions that are suitable 
for expression of the chimeric gene wherein expression of the chimeric gene results in 
production of plant methionine synthase in the transformed host cell; (c) optionally purifying 
the plant methionine synthase expressed by the transformed host cell; (d) treating the plant 
methionine synthase with a compound to be tested; and (e) comparing the activity of the 

1 5 plant methionine synthase that has been treated with a test compound to the activity of an 
untreated plant methionine synthase, thereby selecting compounds with potential for 
inhibitory activity. 

RRTFF DESCR IPTION OF THE 
P>R A WTNGS AND SF-OI IF-NCE DESCRIPT IONS 
20 The invention can be more fully understood from the following detailed description 

and the accompanying drawings and the sequence descriptions which form a part of this 
application. 

Figure 1 shows a comparison of the amino acid sequences of E. coli, yeast, tobacco, 
Catharanthus roseus, corn and soybean MS proteins. 

25 Figure 2 depicts the amino acid sequence alignment between the methionine synthase 

encoded by the corn clone P 0026.ccras26rb (SEQ ID NO:2), contig assembled from soybean 
clones s2.17c08, sdc2c.pk001 .g7, sdp2c.pk001.e21, sdp2c.pk001.n20, sdp2c.pk012.121, 
sd P 2c P k013.dl2, sdp2c.pk042.gl8, sdp3c.pkO01.j3, sdp3c.pk006.n23, sdp3c.pk020.il0, 
S es4d P k0010.fl0, sfll.pkl29.j22, srm.pk0037.h2 and ssm.pk0070.h6 (SEQ IDNO:4), 

30 tobacco clone np.2d06.sk20 (SEQ ID NO:6), wheat clone W lm96. P k0018.clO (SEQ ID 

NO-8) wheat clone wlln. P k0038.e8 (SEQ ID NO:10) and a methionine synthase gene from 
Catharanthus roseus (NCBI General Identifier No. 1 362086, SEQ ID NO:l 1). Amino acids 
which are conserved among all sequences with an amino acid at that position are indicated 
with an asterisk (*). Dashes are used by the program to maximize alignment of the 

35 sequences. 

Table 1 lists the polypeptides that are described herein, the designation of the cDNA 
clones that comprise the nucleic acid fragments encoding polypeptides representing all or a 
substantial portion of these polypeptides, and the corresponding identifier (SEQ ID NO:) as 
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used in the attached Sequence Listing. The sequence descriptions and Sequence Listtng 
attached hereto comply with the rules governing nucleotide and/or amino ac.d sequence 
disclosures in patent applications as set forth in 37 C.F.R. §1.821-1 .825. 



Protein 



SEQ ID NO: 
jTsJiiHpotide) (Amino Acid)_ 



Methionine synthase (corn) 
Methionine synthase (soybean) 



TABLE 1 
Methionine Synthase 

Clone Desig nation^ 
p0026.ccras26rb 

Contig of 
s2.17c08 
sdc2c.pk001.g7 
sdp2c.pk001.e21 
sdp2c.pk001.n20 
sdp2c.pk0 12.121 
sdp2c.pk013.dl2 
sdp2c.pk042.gl8 
sdp3c.pk001.j3 
sdp3c.pk006.n23 
sdp3c.pk020.il 0 
ses4d.pk0010.fl0 
sfll.pkl29.j22 
srm.pk0037.h2 
ssm.pk0070.h6 

np.2d06.sk20 

wlm96.pk00 18x10 

wlln.pk0038.e8 



Methionine synthase (tobacco) 
Methionine synthase (wheat) 
Methionine synthase (wheat) 

SEQ ID NO: 1 1 is the amino acid sequence of a Catharanthus roseus methionine 
synthase NCBI General Identifier No. 1362086. 
, 0 SEQ ID NOS: 12 and 1 3 set forth the sequences of oligonucleotides that were used m 

Example 8 to create a BspH I site at the translation start codon of the tobacco methionme 
synthase gene^ ^ ^ ^ ^ ^ ^ ^ of oligonucleotides that were used in 

Example 8 to create a Kpn I site following the translation stop codon of the tobacco 

1 5 methionine synthase gene. . 

SEQ ID NO: 16 shows the nucleotide sequence of a corn CS cDNA described m 

Example^ ^ ^ ^ ^ ^.^ add sequence of a cor n CS protein derived 

from the nucleotide sequence of SEQ ID NO: 1 6. 
20 SEQ ID NO: 18 shows the nucleotide sequence of a 3639 bp Xba I corn genomic 

DNA fragment encoding two-thirds of the corn CS protein and including 806 bp upstream 
from the protein coding region as described in Example 3. 
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SEO ID NO- 1 9 shows the complete amino acid sequence of the corn CS protein 
deduced from the corn cDNA genomic DNA fragment of SEQ ID NO:7 and the corn 
genomic DNA fragment of SEQ ID NO:8. 

SEQ ID NO S; 20 and 2. show oligonucleotides used ,0 add a translate -a.,on 

codon to the corn CS gene. . 

SEQ ID NO:22 shows the nucleotide sequence of the coding region of the wild t> P e 
E coli lvsC gene, which encodes AKIII, described in Example 5. 

n. cviiLjs^z , a v'TTT derived from the nucleotide 

SEQ ID NO:23 shows the amino acid sequence of AK111 derived irum 

sequence of SEQ ID NO:22. 

SEQ ID NOS:24 and 25 were used in Example 5 to create an Nco I site at the 

translation start codon of the E coli lyj>C gene. 

SEQ ID NOS:26 and 27 were used in Example 6 to screen a corn library for a high 

methionine 10 kD zein gene. „„ 7opnp 
SEQ ID NO:28 shows the nucleotide sequence (2123 bp) of the corn HSZ gene 

Nucleotides 753-755 are the putative translation initiation 

are the putative translation termination codon. Nucleotides 1-752 and 1389-2123 include 
putative 5' and 3' regulatory sequences, respectively. 

SEQ ID NO-29 shows the deduced amino acid sequence of the primary translation 
product of the corn HSZ gene derived from the nucleotide sequence of SEQ ID NO:28. 

SEQ ID NOS:30 and 3 1 were used in Example 7 to modify the HSZ gene by in vitro 
mutagenesis^ ^ ^ ^ ^ ^ ^ ^ ^ ; «^r^ 

only which can be isolated by restriction endonuclease digestion using Nco I (5 -CCATOO) 
to Xba I (5-TCTAGA). Two Nco I sites that were present in the native HSZ coding region 
were eliminated by site-directed mutagenesis, without changing the encoded amino acd 

SeqUen< SEQ ID NO:33 shows the deduced amino acid sequence of the corn HSZ protein 
derived from the nucleotide sequence of SEQ ID NO:32. 

SEQ ID NOS:34 and 35 were used in Example 7 to create a form of the HSZ gene 
30 with alternative unique restriction endonuclease sites. 

SEQ ID NOS:36 and 37 were used in Example 7 to create a gene to code for the 

mature form of HSZ. . f ■ 

SEQ ID N0 38 shows a 579 bp DNA fragment including the coding region of the 
mature corn HSZ protein only, which can be isolated by restriction endonuclease digestion 
35 using Bs P H I (5'-TCATGA) to Xba I (5-TCTAGA). Two Nco I sites « pr. nt * 
the native HSZ coding region were eliminated by site-directed mutagenesis. Th,s was 
accomplished without changing the encoded amino acid sequence. 
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SEQ ID NO:39 shows the deduced amino acid sequence of the corn HSZ protein 
derived from the nucleotide sequence of SEQ ID NO:38. 

SEQ ID NOS:40-45 were used in Example 8 to create a corn chloroplast transit 
sequence and link the sequence to the E coli iysC-M4 gene. 
5 SEQ ID NOS:46-49 were used in Example 9 to create a soybean chloroplast transit 

sequence and link the sequence to the E. coli ]ysC-M4 gene. 

SEQ ID NOS:50-51 were used in Example 9 and 10 as PCR primers to prepare a 
DNA fragment carrying the soybean chloroplast transit sequence. 

SEQ ID NO:52 was used in Example 9 to remove the corn chloroplast transit 
10 sequence from the corn CS gene. 

SEQ ID NOS:53-54 were used in Example 10 as PCR primers to isolate and modify 

the E. coli metL gene. 

SEQ ID NO:55 shows the nucleotide sequence of a partial soybean MS cDNA, 

described in Example 1 . 
1 5 The Sequence Descriptions contain the one letter code for nucleotide sequence 

characters and the three letter codes for amino acids as defined in conformity with the 
IUPAC-IYUB standards described in Nucleic Acids Research 13:3021-3030(1985) and in 
the Biochemical Journal 219 (No. 2):345-373(1984) which are incorporated by reference 
herein. The symbols and format used for nucleotide and amino acid sequence data comply 

20 with the rules set forth in 37 C.F.R. §1.822. 

nFTATT.F.D DESC RIPTION OF THE INV ENTION 
The teachings below describe nucleic acid fragments, chimeric genes and procedures 
useful for increasing the accumulation of methionine in the seeds of transformed plants, as 
compared to levels of methionine in untransformed plants. 

25 In the context of this disclosure, a number of terms shall be utilized. As used herein, 

an "isolated nucleic acid fragment" is a polymer of RNA or DNA that is single- or double- 
stranded, optionally containing synthetic, non-natural or altered nucleotide bases. An 
isolated nucleic acid fragment in the form of a polymer of DNA may be comprised of one or 
more segments of cDNA, genomic DNA or synthetic DNA. 

30 The terms "subfraement that is functionally equivalent" and "functionally equivalent 

subfragment" are used interchangeably herein. These terms refer to a portion or 
subsequence of an isolated nucleic acid fragment in which the ability to alter gene 
expression or produce a certain phenotype is retained whether or not the fragment or 
subfragment encodes an active enzyme. For example, the fragment or subfragment can be 

35 used in the design of chimeric genes to produce the desired phenotype in a transformed 

plant Chimeric genes can be designed for use in co-suppression or antisense by linking a 
nucleic acid fragment or subfragment thereof, whether or not it encodes an active enzyme, ,n 
the appropriate orientation relative to a plant promoter sequence. 
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The terms "substantia.* simi.ar" and "corresponding substantially" as used herein 
refer toxoid fra gm en t s herein changes ,n one or more nuc.eohde a,es s n 0t 

affect ,he ability of the nucleic acid ^^^o^ZZZ ^ZL of 
certain phenotype. These terms also refer to modif.cat.ons 
5 the instant invention such as deletion or insertion of one or more nucleo , des * do 

substantially a.ter the Actional properties of the ^"^^^^ 
the initial, unmodified fragment. It is therefore understood, as those skn le 1 , < 
appreciate, that the invention encompasses more than the specfic 

Moreover the skilled artisan recognizes that substant.ally s.m.lar nucletc acd 

moderately stringent conditions (for example, 0.5 X SSC, 0.1 /o bUb, ou ^ 
^™ hfied herein, or to any portion of the nucleotide sequences .reported here.n 
and which are functionally equivalent to the nucleic acid fragment of the ; mvent.oa 
Preferred substantial similar nucleic acd sequences encompassed by "™ 

,5 those sequences that are 80% identical to the nucle.c acid fragments reported H e n o^ 

which are 80% identical to any portion of the nucleotide sequences ^ 
preferred are nucleic acid fragments which are 90% identical to the nude c acd sequences 
eported herein, or which are 90% tdentica, to any portion of the nuCeot.de seq = 
reported herein. Most preferred are nucleic acid fragments wh.ch are 95 A ^ 

20 nuc.eic acid sequences reported herein, or which are 95% idenfcal to any port.on of the 
nucleotide sequences reported herein. Sequence a.ignmen.s and percent ^mdamy 
calculations may be determined using a variety of comparison methods des.gnd to detect 
homologous sequences tncluding, but not limited* £ Megahgr, program * 

LASARGENE bioinforma.ics ^ h '„d of a gW nen, (Higgins 

25 alignment of the sequences are performed using the Clustal me hod > E ' 

and Sharp (1989) CABIOS. 5:151-1 53) with the default parameters (GAP PENALTY 10, 
GAP LENGTH PENALTY=10). Default parameters for pairwise ahgnments and 
calculation of percent identity of protein sequences using the 

KTUPLE-1, GAP PENALTY=3, WINDOWS and DIAGONALS SAVED .For 
30 acids these parameters are GAP PENALTY=10, GAP LENGTH ^ALTY-IO, 

KTUPLE=2 GAP PENALTY=5, WINDOW=4 and DIAGONALS SAVED=4. 

A "substantia, portion" of an amino acid or nucleotide sequence composes enough of 

the amino acid sequence of a polypeptide or the nucleotide sequence of a gene to afford 

putative identification of that polypeptide or gene, either by manual eva.uat.on of the 
35 sequence by one skilled in the art, or by computer-automated sequence comparison and 

Ztification ustng algorithms such as BLAST (A.tschul, S. F, e, a,., (.993; .. Mo,. B,ot 

2V5-403-410) and Gapped Blast (Altschul, S. F. et al., (1997) Nucleic Acids Res. 

25:3389-3402); see also .nv/M.AST/). Thus, a substantial port.on of an 
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amino acid or nucleotide sequence comprises an amino acid or a nudeot.de sequence that „ 
sufficient to afford putative identification of the protein or gene that the am.no ac.d or 
nucleotide sequence comprises. Amino acid and nucleotide sequences can be evaluated 
either manually by one ski.led in the art, or by using computer-based sequence comparison 
and identification tools that employ algorithms such as BLAST (Basic Local Ahgnmen, 
Search Tool; Altschul etal. (1993)7- Mol. Biol. 275:403-410; see also 
www.ncb..nln,nih.gov/BLAST/). In general, a sequence often or more cont.guous ammo 
acids or thirty or more contiguous nucleotides is necessary in order to putat.vely .dent.fy a 
polypeptide or nucleic acid sequence as homologous to a known protein or gene. Moreover, 
with respect to nucleotide sequences, gene-s P ecif.c oligonucleotide probes compns.ng 30 or 
more contiguous nucleotides may be used in sequence-dependent methods of gene 
identification (e.g., Southern hybridization) and isolation (e.g., in siiu hybnd.zat.on of 
bacterial colonies or bacteriophage plaques). In addition, short oligonucleotides of 12 or 
more nucleotides may be used as amplification primers in PGR in order to obta.n a part.cular 
nuc.eic acid fragment comprising the primers. Accordingly, a "substantial ^ <*\ 
nucleotide sequence comprises a nucleotide sequence that will afford spec.fic ,den.,ficat,on 
and/or isolation of a nucleic acid fragment comprising the sequence. The mstant 
specification teaches amino acid and nucleotide sequences encoding polypeptide, tha 
comprise one or more particular plan, proteins. The skilled artisan, hav.ng the benefit of the 
sequences as reported herein, may now use all or a substantial port.on of the d.sclosed 
sequences for purposes known to those ski.led in this art. Accordingly, the mstan. mvenfon 
comprises the complete sequences as reported in the accompanying Sequence L.st.ng, as 
well as substantial portions of those sequences as defined above. 

It is therefore understood that the invention encompasses more than the specific 
exemplary nucleotide or amino acid sequences and includes functional equivalents thereof. 

For example, i, is well known in the art that antisense suppression and co-suppress.on 
of gene expression may be accomplished using nucleic acid fragments represent.ng less than 
the entire coding region of a gene, and by nucleic acid fragments that do no. share 100/. 
sequence identity with the gene to be suppressed. Moreover, alterations ,n a nuc e.c ac.d 
fragment which result in the production of a chemically equiva.en« ammo acd a. a g.ven s,,e, 
but do no. effect the functiona. properties of the encoded polypeptide, are well known m .he 
art Thus a codon for the amino acid alanine, a hydrophobic amino acd, may be subs..tu.ed 
by a codon encoding another less hydrophobic residue, such as glyc.ne, or a more 
hydrophobic residue, such as valine, leucine, or isoieucine. Similarly, changes wh.ch result 
in substitution of one negatively charged residue for another, such as aspart.c ac.d for 
glutamic acid, or one positively charged residue for another, such as lys.ne for argm.ne, can 
also be expec.ed to produce a functionally equivalent product. Nucleotide changes wh.ch 
result in alteration of the N-terminal and C-.erminal portions of the polypeptide molecule 
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would also not be expected to alter the activity of the polypeptide. Each of the proposed 
modifications is well within the routine skill in the art, as is determination of retention of 
biological activity of the 'encoded products. 

As was mentioned above, substantially similar nucleic acid fragments may also be 
5 characterized by their ability to hybridize. Estimates of such homology are provided by 
either DNA-DNA or DNA-RNA hybridization under conditions of stringency as is well 
understood by those skilled in the art (Hames and Higgins, Eds. (1985) Nucleic Acid 
Hybridisation, IRL Press, Oxford, U.K.). Stringency conditions can be adjusted to screen for 
moderately similar fragments, such as homologous sequences from distantly related 
10 organisms, to highly similar fragments, such as genes that duplicate functional enzymes from 
closely related organisms. Post-hybridization washes determine stringency conditions. One 
set of preferred conditions uses a series of washes starting with 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45°C for 30 min, and then 
repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 30 min. A more preferred set of 
1 5 stringent conditions uses higher temperatures in which the washes are identical to those 
above except for the temperature of the final two 30 min washes in 0.2X SSC, 0.5% SDS 
was increased to 60°C. Another preferred set of highly stringent conditions uses two final 
washes in 0.1 X SSC, 0.1% SDS at 65°C. 

Substantially similar nucleic acid fragments of the instant invention may also be 
20 characterized by the percent similarity of the amino acid sequences that they encode to the 

amino acid sequences disclosed herein, as determined by algorithms commonly employed by 
those skilled in this art. Most preferred are nucleic acid fragments that encode amino acid 
sequences that are at least 87.6% identical to the amino acid sequence set forth in SEQ ID 
NO-2 or at least 87.8% identical to the amino acid sequence set forth in SEQ ID NO:4, or at 
25 least 92 5% identical to the amino acid sequence set forth in SEQ ID NO:6, or at least 86.3% 
identical to the amino acid sequence set forth in SEQ ID NO:8, or at least 80% identical to 
the amino acid sequence set forth in SEQ ID NO: 10. Sequence alignments and percent 
identity calculations were performed using the Megalign program of the LASERGENE 
bioinformatics computing suite (DNASTAR Inc., Madison, WI). Multiple alignment of the 
30 sequences was performed using the Clustal method of alignment (Higgins and Sharp (1989) 
CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH 
PENALTY=10). Default parameters for pairwise alignments using the Clustal method were 
KTUPLE 1 GAP PENALTY=3, WINDOWS and DIAGONALS SAVED=5. 

As used herein, "contig" refers to a nucleotide sequence that is assembled from two 
35 or more constituent nucleotide sequences that share common or overlapping regions of 
sequence homology. For example, the nucleotide sequences of two or more nucleic acid 
fragments can be compared and aligned in order to identify common or overlapping 
sequences. Where common or overlapping sequences exist between two or more nucleic 

10 



tS***i! ii—ii -ir»ii *j*L-j 

iLJl ^ CI 



««y tfMl «fl -***~t« •»"'!' '""]?- 

13 % «, 3us jl o iu> ici 



acid fragments, the sequences (and thus their corresponding nucleic acid fragments) can be 
assembled into a single contiguous nucleotide sequence. 

"Codon degeneracy" refers to divergence in the genetic code permitting variation ol 
the nucleotide sequence without effecting the amino acid sequence of an encoded 
5 polypeptide. Accordingly, the instant invention relates to any nucleic acid fragment 

comprising a nucleotide sequence that encodes all or a substantial portion of the amino acid 
sequences set forth herein. The skilled artisan is well aware of the "codon-bias" exhibited 
by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
Therefore, when synthesizing a nucleic acid fragment for improved expression in a host cell, 

10 it is desirable to design the nucleic acid fragment such that its frequency of codon usage 
approaches the frequency of preferred codon usage of the host cell. 

"Synthetic nucleic acid fragments" can be assembled from oligonucleotide building 
blocks that are chemically synthesized using procedures known to those skilled in the art. 
These building blocks are ligated and annealed to form larger nucleic acid fragments which 

1 5 may then be enzymatically assembled to construct the entire desired nucleic acid fragment. 
"Chemically synthesized", as related to nucleic acid fragment, means that the component 
nucleotides were assembled in vitro. Manual chemical synthesis of nucleic ac:d fragments 
may be accomplished using well established procedures, of automated chemical synthesis 
can be performed using one of a number of commercially available machines. Accordingly, 

20 the nucleic acid fragments can be tailored for optimal gene expression based on optimization 
of nucleotide sequence to reflect the codon bias of the host cell. The skilled artisan 
appreciates the likelihood of successful gene expression if codon usage is biased towards 
those codons favored by the host. Determination of preferred codons can be based on a 
survey of genes derived from the host cell where sequence information is available. 

25 "Gene" refers to a nucleic acid fragment that expresses a specific protein, including 

regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding 
sequences) the coding sequence. "Native gene" refers to a gene as found in nature with its 
own regulatory sequences. "Chimeric gene" refers any gene that is not a native gene, 
comprising regulatory and coding sequences that are not found together in nature. 

30 Accordingly, a chimeric gene may comprise regulatory sequences and coding sequences that 
are derived from different sources, or regulatory sequences and coding sequences derived 
from the same source, but arranged in a manner different than that found in nature. 
"Endogenous gene" refers to a native gene in its natural location in the genome of an 
organism. A "foreign" gene refers to a gene not normally found in the host organism, but 

35 that is introduced into the host organism by gene transfer. Foreign genes can comprise 

native genes inserted into a non-native organism, or chimeric genes. A "transgene" is a gene 
that has been introduced into the genome by a transformation procedure. "Coding sequence- 
refers to a DNA sequence that codes for a specific amino acid sequence. "Regulatory 
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sciences" refer to nucleotide sequences located upstream (5' non-coding 
II n o downstream (3' non-coding sequences) of a coding sequence, and wh.ch mfiuence 
2 Inscription, RNA processing or stab.lity, or translation of the assocated codmg 
h e Ice. Regulator, sequences ma y indude, b u, are no, lim.ted ,0, ™~ 
5 leader sequences, introns, and polyadeny.a.ion recogmt.on sequences. Promoter refers to 
a DNA s quence capable of contro.ling the expression of a coding sequence or funct.on 
RNA. The promoter sequence consists of proximal and more dista upstream elements, the 
latter dements often referred to as enhancers. Accordingly, an -enhancer „ a DNA 
sequence which can stimulate promoter activity and may be an innate element of the 
,0 promoter or a heterologous element inserted to enhance the .eve, or t.ssue-spec.ficty of a 
promote, Promoters may be derived in their entirety from a native gene, or be composed of 
different elements derived from different promoters found in nature, or even compnse 
^1 tic DNA segments. I. is understood by those skilled in the ar, that d.fferent promoters 
* c< *e expression of a gene in different tissues or cell types, or a. d.fferent stages of 
, 5 Zlopment, or in response to different envi_tal conditions Promoters wh.ch cause a 
gene to be expressed in most cell types a. most times are commonly referred o 
"constitutive promoters". New promoters of various types useful in plan. ceHs are 
constantly being discovered; numerous examples may be found in the comp.la.ton by 
Olluro and Goldberg, (1989) Bioc h e mi s,ry ofF^s »:l-82. I. is further recogn.zed ha, 
20 since in mos, cases the exact boundaries of reg ul atory sequences have not been complete., 
defined DNA fragments of some variation may have identical promoter acv.ty. 

An "intron ' is an intervening sequence in a gene that does no. encode a port.on of he 
protein sequence. Thus, such sequences are .ranscribed in.o RNA bu. are <^™^ d 
are no. transited. The term is also used for the excised RJs'A sequences. An exon ,s a 

2S ortion of the sequence of a gene that is transcribed and is found ,n .he ma.ure 

RNA derived from the gene, but is no, necessari.y a par, of ,he sequence that encodes the 

fma ' "relation .eader sequence" refers to a nucleot.de sequence located between the 
promoter sequence of a gene and the coding sequence. The translation leader sequence ,s 
30 present in the fully processed mRNA upstream of the translation star, ™= 
Lsla.ion leader sequence may affec, processing of the primary transcnp. to mRNA 
m^A s.abi,„y or .L.a,,on efficiency. Examp.es of ,ans,a.,on leader sequences have 
been described (Turner and Foster (1 995) Molecular Biotechnology 3:225). 

The "3' non-coding sequences" refer ,0 nuc.eo.ide sequences loca.ed downstream of 
35 a coding sequence and .nclude polyadeny.ation recognition sequences and other sequences 
needing regulatory signals capable of affecting mRNA processing or gene express.on The 
ola yla ion st^na, is usually characterized by affecting the addition of polyadenyhc acd 
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tra cts to .he 3' end of the mRNA precursor. The use of different 3' non-coding sequences is 
exemplified by Ingelbrecht et al. (1989) Plant Cell l:67\-6&0. 

"RNA transcript"«fers to the product resulting from RNA polymerase-catalyzed 
transcription of a DNA sequence. When the RNA transcript is a perfect complementary 
copy of the DNA sequence, it is referred to as the primary transcript or ,« may be a RNA 
sequence derived from posttranscriptional processing of the primary *nd .s 

referred to as the mature RNA. "Messenger RNA (mRNA)" refers to the RNA that ,s 
without introns and that can be translated into protein by the cell. cDNA refers to a DNA 
that is complementary to and synthesized from a mRNA template using ^J™ ™™ 
transcriptase. The cDNA can be single-stranded or converted into the double-stranded form 
using the Menow fragment of DNA polymerase .. "Sense" RNA refers to RNA transcr.pt 
that includes the mRNA and so can be translated into protein within a cell or m v„ro. 
"Antisense RNA" refers to an RNA transcript that is complementary to al. or part of a target 
primary transcript or mRNA and that blocks the expression of a target gene (U.S. Patent 
No 5 107 065) The complementarity of an antisense RNA may be with any part of the 
specific gene transcript, i.e., a. the 5' non-coding sequence, 3' non-coding sequence, ^ntrons, 
or the coding sequence. "Func.iona. RNA" refers to antisense RNA, nbozyme RNA, or 
other RNA that may not be translated but ye, has an effect on cellular processes. The terms 
"complement" and "reverse complement" are used interchangeably herein with respect to 
mRNA transcripts, and are meant to define the antisense RNA of the message. 

The term "operably linked" refers to the association of two or more nucle.c acd 
fragments on a single nucleic acid fragment so that the function of one is affected by the 
other For example, a promoter is operably linked with a coding sequence when „ ,s capable 
of affecting the expression of that coding sequence (i.e., that the coding sequence .under 
the transcriptional control of the promoter). Coding sequences can be operably hnked 
regulatory sequences in sense or antisense orientation. 

The term "expression", as used herein, refers to the transcription and stable 
accumulation of sense (mRNA) or antisense RNA derived from the nucleic acd fragment of 
the invention. Expression may also refer to translation of mRNA into a polypept.de. 
"Antisense inhibition" refers to the production of antisense RNA transcripts capable , of 
suppressing the expression of the target protein. "Overexpression" refers to the production 
of a gene product in transgenic organisms that exceeds levels of production ,n nonnal or 
non-transformed organisms. "Co-suppression" refers to the production of sense RN A 
transcripts capable of suppressing the expression of identical or substantially s,m,.ar foreign 
or endogenous genes (U.S. Patent No. 5,231,020, incorporated herein by reference). 

"Altered levels" refers to the production of gene produc.(s) in transgenic organisms 
in amounts or proportions that differ from that of normal or non-transformed organisms. 
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"Altered expression" refers to the production of gene product(s) in transgenic 
organisms in amounts or proportions that differ significantly from that actmty ,n 
comparabie tissue (organ-and of developmental type) from wild-type orgamsms. 

"Mature" protein refers to a post-translationally processed polypeptide; ,.e„ one from 
5 which any pre- or propeptides present in the primary translation product have been removed. 
"Precursor" protein refers to the pnmary product of translation of mRNA; ,.e ; , w,th pre- and 
propeptides still present. Pre- and propeptides may be but are not limited to mtracellular 

localization signals. . 
A "chloroplast transit peptide" is an amino acid sequence which is translated m 

10 conjunction with a protein and directs the protein to the chloroplast or other plastid types 
present in the cell in which the protein is made. "Chloroplast transit sequence refers to a 
nucleotide sequence that encodes a chloroplast transit peptide. A "signal pept,de is an 
amino acid sequence which is translated in conjunction with a protein and directs the pro ein 
to the secretory system (Chrispeels (1991) Ann. Rev. Plant Phys. Plant Mol Btol. 42.21-53). 

15 If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can further be 
added or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) 
may be added. If the protein is to be directed to the nucleus, any signal peptide present 
should be removed and instead a nuclear localization signal included (Raikhel (1992) Plant 
Phys. 700:1627-1632). 

20 "End-product inhibition" or "feedback inhibition" refers to a biological regulatory 

mechanism wherein the catalytic activity of an enzyme in a biosynthetic pathway is 
reversibly reduced by binding to one or more of the end-products of the pathway when the 
concentration of the end-product(s) reaches a sufficiently high level, thus slowing the 
biosynthetic process and preventing over-accumulation of the end-product. 

25 "Transformation" refers to the transfer of a nucleic acid fragment into the genome of 

a host organism, resulting in genetically stable inheritance. Host organisms containing the 
transformed nucleic acid fragments are referred to as "transgenic" organisms. The preferred 
method of cell transformation of rice, corn and other monocots is the use of partic e- 
accelerated or "gene gun" transformation technology (Klein et al., (1 987) Nature (London) 

30 527-70-73- U.S. Patent No. 4,945,050), or an Agrobacterium-mediated method using an 
appropriate Ti plasmid containing the transgene (Ishida Y. et al., 1996, Nature Biotech. 

14:745-750). . 

Standard recombinant DNA and molecular cloning techniques used herein are well 
known in the art and are described more fully in Sambrook, J., Fritsch, E.F. and Mamata. T. 
35 Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold 
Spring Harbor, 1 989 (hereinafter "Sambrook"). 

"PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of large 
quantities of specific DNA segments, consists of a series of repetitive cycles (Perkm Elmer 
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Cetus Instruments, Norwalk, CT). Typically, the double stranded DNA is hea, denatured, 
the two primers complementary to the 3' boundaries of the target segment are annealed a. 
low temperature and then extended at an intermediate temperature. One set of these three 
consecutive steps is referred to as a cycle. 
5 An "expression construct-as used herein comprises any of the isolated nucleic acid 

fragments of the invention used either alone or in combination with each other as discussed 
herein and further may be used in conjunction with a vector or a subfragment thereof If a 
vector is used then the choice of vector is dependent upon the method that will be used to 
transform host plants as is well known to those skilled in the art. For example, a plasmid 
1 0 vector can be used. The skilled artisan is well aware of the genetic elements that must be 
present on the vector in order to successfully transform, select and propagate host cells 
comprising any of the isolated nucleic acid fragments of the invention. The skilled artisan 
will also recognize that different independent transformation events will result in different 

/t ♦ „i nae<^ Fund J 4 241 1-2418; De Almeida et 
levels and patterns of expression (Jones et al., (1985) tMBUJ. > 

15 al (1989) Mo!. Gen. Genetics 275:78-86), and thus that multiple events must be screened in 
order to obtain lines displaying the desired expression level and pattern. Such screening 
may be accomplished by Southern analysis of DNA, Northern analysis of mRNA 
expression, Western analysis of protein expression, or phenotypic analysis. The terms 
"expression construct" and "recombinant expression construct" are used interchangeably 

20 herein. . . 

The invention concerns an isolated nucleic acid fragment comprising a nucleotide 
sequence selected from the group consisting of (a) a nucleotide sequence corresponding to 
any of the nucleotide sequences set forth in SEQ ID NOS: 1, 3, 5, 7 or 9 or the complement 
thereof, or (b) the nucleotide sequence of (a) wherein said sequence is degenerate in 
25 accordance with the degeneracy of the genetic code. 
Isolation of a plant MS gene 

In order to increase the accumulation of free methionine in the seeds of plants via 
genetic engineering, a gene encoding 5-methyltetrahydropteroyl-triglutamate-homocysteine 
methyltransferase, also known as methionine synthase (MS), was isolated from several crop 
30 plants. MS catalyzes the final reaction in the biosynthesis of methionine. 

It is shown that plant MS genes can be isolated and identified by comparison of 
random plant cDNA sequences to the GenBank database using the BLAST algorithms well 
known to those skilled in the art. The use of this approach to isolate tobacco, soybean and 
corn MS cDNA genes is presented in detail in Example 1 . The nucleotide sequence of a corn 
35 MS cDNA is provided in SEQ ID NO:l, the nucleotide sequence of a soybean MS cDNA 
assembled from a contig is provided in SEQ ID NO:3, the nucleotide sequence of a tobacco 
MS cDNA is provided in SEQ ID NO:5, and the partial nucleotide sequence of wheat MS 
cDNAs is provided in SEQ ID NOS:7 and 9. MS genes from other plants can now be 
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r j „nNA scauences to the plant MS sequences provided 
identified by comparison of random cDNA sequences . > P ^ 
herein. Alternatively, other plant MS genes, either as cDNAs or genom 
isolated directly by thither the tobacco, ^Itlpioying 
fragment as a DNA hybridization probe to screen libraries Irom any 

methodology we., known to those ca „ be used 10 crea .e chimeric genes 

Nucleic acid fragments carrying plant MS genes can o 
which are useful for over-expressing MS in plant cells and in heterologous host cells. When 

c!l,s ,. is particularly use*, to use the MS gene to increase the methionine content in the 

cells of the seeds of plants. , 

1, may also be desirable to reduce or eliminate expression of the MS gene plan 
for some applications. In order to accomp.ish this, a chimeric gene designed for 
^suppression of MS can be constructed by linking the MS gene or gene fragment to a plan. 
ZZIZ^ (See U.S. Patent No. 5,231,020 for methodology to block plan, gene 
^LTX^on.) Aitematively, a chimeric gene designed to express antisense 
Z for a,, or part "the MS gene can be constructed by - 
fragment in reverse orientation to a plant promoter sequences. (See U.S. Paten 
No 5 107 065 for methodology to block plant gene expresston v,a anfs nse RNA.) 
the compression or antisense chimeric gene could be introduced into plants via 
trl— on. Transforms wherein expression of the endogenous MS gene ,s reduced or 

in heterologous host ce„s, particular, in the ce.ls of 
microbials, can I used to prepare antibodies to the protein by — 
those skilled in the art. The antibodies are useful for detecting plant MS protem in s tu n 
cdls or in vi. o in cell extracts. Preferred heterologous host cells for production of plant MS 

1 n "e m 1 losts. Microbial expression systems and expression vectors containing 
ory sequences tha, direct high level expression of foreign proteins — » 

those skilled in the art. Any of these could be used to construct chimeric genes for 
"„ of P.an, MS. These chimeric genes cou.d then be introduced mto appropriate 

microorganisms via transformation to provide high level expression of plant MS. An 

^pTof a vector for high level expression of plan, MS in a bacteria, host ,s provided 

<EX!m t Mother aspect, this invention concerns a polypeptide comprising all or a 
substantia, portion of the amino acid sequence set forth in SEQ ID NOSS. 4£ 8 £ ^ 

Additionally, the P ,an, methionine synthase protein can be used as ata get to design 
and/or identify inhibitors of the enzyme that may be use&, as herbicides. Th. ■ ^esiraWe 
because methionine synthase ca.a,yzes a necessary step in the essential methionine 
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biosynthetic pathway. Since methionine is metabolized to S-adenosyl-methiomne, which is 
used in many important cellular processes, inhibition of methionine biosynthesis results m 
pleiotropic effects, whichpotentiate herbicidal activity. Accordingly, inhibition of 
methionine synthase activity could lead to inhibition of plant growth. Plant methionine 
5 synthase differs sufficiently from animal methionine synthase in amino acid sequence and 
action mechanism (Eichel et al. (1995) Eur J Biochem 250:1053-1058; Yamada et al. (1998) 
Biosci Biotechnol Biochem 62:2155-2160) that some inhibitors of plant methionine synthase 
are likely to be plant-specific. Thus, the instant polypeptides could be appropriate for new 
herbicide discovery and design. 
10 In still another embodiment, this invention concerns a method for evaluating at least 

one compound for its ability to inhibit the activity of a plant methionine synthase, the 

method comprising the steps of: 

(a) transforming a host cell with a chimeric gene comprising a nucleic acid 
fragment encoding a plant methionine synthase, operably linked to suitable 

\ 5 regulatory sequences; 

(b) growing the transformed host cell under conditions that are suitable for 
expression of the chimeric gene wherein expression of the chimeric gene 
results in production of the plant methionine synthase encoded by the 
operably linked nucleic acid fragment in the transformed host cell; 

2 0 (c) optionally purifying the plant methionine synthase expressed by the 

transformed host cell; 

(d) treating the plant methionine synthase with a compound to be tested; and 

(e) comparing the activity of the plant methionine synthase that has been treated 
with a test compound to the activity of an untreated plant methionine 

25 synthase, 

thereby selecting compounds with potential for inhibitory activity. 

Another aspect of the invention concerns an isolated nucleic acid fragment 

comprising: 

(a) a first nucleic acid fragment comprising a nucleotide sequence selected from 
30 the group consisting of (a) a nucleotide sequence corresponding to any of the nucleotide 

sequences set forth in SEQ ID NOS: 1 , 3, 5, 7 or 9 or the complement thereof, or (b) the 
nucleotide sequence of (a) wherein said sequence is degenerate in accordance with the 

degeneracy of the genetic code, and 

(b) a second nucleic acid fragment encoding a plant cystathionine y-synthase or a 

3 5 functionally equivalent subfragment thereof. 
Isolation o f a plant CS gene 

Cystathionine y-synthase (CS) catalyzes the first reaction wherein cellular metabolites 
are committed to the synthesis of methionine and has been implicated to play a key role m 
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the regulation of methionine biosynthesis. Regulation is not achieved through feedback 
inhibition of CS by any of the pathway end-products [Thompson et al. (1982) Plant Physiol 
59-1077-1083], however/ Thus, over-expression of CS is expected to increase flux through 
the methionine branch of the biosynthetic pathway, even when high levels of meth.onine are 
5 accumulated. 

In order to increase the accumulation of free methionine in the seeds of plants it may 
desirable to increase the expression of cystathionine y-synthase (CS) in concert with MS. 
Therefore a gene encoding plant cystathionine y-synthase (CS) is provided. Also prodded 
herein is a unique nucleic acid fragment containing a plant MS gene linked to a plant CS 

10 gene. . 

A plant CS gene was isolated by complementation of an E coli host strain bearing a 

metB mutation. Such a strain requires methionine for growth due to inactivation of the 

E coli gene that encodes CS. Functional expression of the plant CS gene allowed the strain 

to grow in the absence of methionine. The use of this approach to isolate a corn CS cDNA 

15 gene is presented in detail in Example 3. The nucleotide sequence of a corn CS cDNA is 

provided in SEQ ID NO: 1 6. CS genes from other plants could be similarly isolated by 

functional complementation of an E coli metB mutation. Alternatively, other plant CS 

genes, either as cDNAs or genomic DNAs, could be isolated by using the corn CS gene as a 

DNA hybridization probe. 

This invention also concerns a method for increasing methionine content of the seeds 

of plants comprising: 

(a) transforming plant cells with a chimeric gene comprising an isolated nucleic 
acid fragment comprising a nucleotide sequence selected from the group consisting of (a) a 
nucleotide sequence corresponding to any of the nucleotide sequences set forth in SEQ ID 
NOS-1 3 5 7 or 9 or the complement thereof, or (b) the nucleotide sequence of (a) wherein 

said sequence is degenerate in accordance with the degeneracy of the genetic code operably 
linked to a regulatory sequence, or a nucleic acid fragment comprising (1) a chimeric gene 
comprising an isolated nucleic acid fragment comprising a nucleotide sequence selected 
from the group consisting of (a) a nucleotide sequence corresponding to any of the 
30 nucleotide sequences set forth in SEQ ID NOS:l, 3, 5, 7 or 9 or the complement thereof, or 
(b) the nucleotide sequence of (a) wherein said sequence is degenerate in accordance with the 
degeneracy of the genetic code operably linked to a regulatory sequence and (2) a second 
chimeric gene comprising a nucleic acid fragment encoding a plant cystathionine y-synthase 
or a functionally equivalent subfragment thereof or a complement thereof operably linked to 

35 a regulatory sequence; 

(b) growing fertile mature plants from the untransformed plant cells obtained 

from step (a) under conditions suitable to obtain seeds; and 
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se.ec.ing progeny seed of step (b) for those seeds containing inereased .eve.s of 
methionine compared to untransformed seeds. 

^^of « plants compnsing in the, genome such chimenc genes and seeds 

obtained from such plants. producing plant methionine 

In still another aspect this invention conerns a method tor proauc g p 

synthase comprising: . . ; cn u t rH nucleic 

(a) transforming host cells with a ehimerie gene eompnsmg an tsolated nude c 

acid fragment comprising a nucleotide sequence selected from the 

nucleottde sequence corresponding to any of the nucleotide sequences set M n EC D 
NOS1 3 5, 7 or 9 or the complement thereof, or (b) the nucleofde sequence of (a) where.n 
^sequence is degenerate in accordance with the degeneracy of the genettc ^ ; oper * y 
linked to a regulatory sequence, or a nuc.eic acid fragment eompnsmg (1) a ch.menc gene 
Uprising an isolated nucleic acid fragment comprising a nucleotide 
from the group consisting of (a) a nucleotide sequence corresponding to «^ 
nucleotide sequences set forth in SEQ ID NOS: 1. 3, 5, 7 or 9 or the complement thereof , o 
Z Z nuclide sequence of (a, wherein said sequence is degenerate in accor ancewnh the 
degeneracy of the genetic code operably linked to a regulatory sequence and (2) a second 

ZIc gene comprising a nucleic acid fragment encoding a 
or a functionally equivalent subfragmen, thereof or a complement thereof operably hnked 

a re ^atory^quence^ ^ ^ ^ obtained from step (a) under 

conditions that result in expression of a plant methionine synthase protem 

As is clear from the discussion above, the host cell can be a plant cell or a mtcrob.al 

cell. 

Isolation of A K Genes . 

Over-expression of feedback-insensitive AK increases flux through the enure 
pathway of aspartate-derived amino acids even in the presence of high concentrations of the 
CZ end-Prooucts ,ysine, threonine and methionine. This increased flux prov. es more 
ubstrate for CS and MS and increases the potential for methionine ^"^^ 
Provided herein is a unique nucleic acid fragment contammg a plant MS gene hnked 
,0 a plant CS gene and a gene for AK, which is insensitive to feedback tnhtb.Uon by e d- 

roducrs of th 8 e biosynthetic pathway. Also provided is a unique nucletc a«me„, 
containing a plant MS gene linked to a plant CS gene and a gene for AK-HDH, both 

"f which are Insensitive to feedback-inhibition by end-products of the btosy nthette 
pathway. Over-expression of feedback-insensitive AK-HDH d.rects the ,nc eas d flux 
through the methionine-threonine branch of the aspartate-derived ammo actd pathway, 
further increasing the potential for methionine biosynthesis. 
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A number of AK and AK-HDH genes have been isolated . and 

include ,he thrA gene of* coH (Katinka e. a. (^^^^.^^et 25 3028-303 , ,, 

57331 the metL gene of E. coli Zakin et al. (1983) J Biol. cnem. z 
he )2 o £ c^Cassan e, a,. (1986),. Biol Cnen,. 2oV:,052-,057] and the HOM1 

5 ^ffjZlLae [Rafalski e, ah (.988)7. *««>/. 263:2,46-2,51]. The thrA gene 

f £ c » encodes a Afunctional protein, AKI-HDHI. The AK activity of this enzyme ,s 
tL i d by threonine. The rr^tL gene of*, coli also encodes a afunctional protein 
AKII-HDHII, and the AK activity of this enzyme is insensitive to a, pathway end-products. 
The E. coli taE gene encodes AKIII. which is sensitive to .ysme inh.b.fon. The HOM3 

in eene of veast encodes an AK which is sensitive to threonine. 

Abdicated above AK genes are readily availab.e to one skilled in the art for use ,n 
fce presC t invention. A preferred Cass of AK genes encoding ^back-insensdive enzymes 
are derived from the E. coli lysC gene. Procedures useful for the .solatton of the w d type 
E coli fysC gene and lysine-insensitive mutations are presented in deta.l m Example 5. 

,5 The sequences of three mutant lysC genes that encoded lysine-insensitive 

aspartokTnaselach differed from tbe wi,d type sequence by a single .nucleotide resu tmg ,n a 
single amino acid substitution in the protein. Other mutations could be *^ « **» 
target sites m yto by site-directed mutagenesis, using methods known to those ski.led ,n the 
art Sueh mutations would be expected to result in a lysine-insensitive enzyme. 

20 Furthermore, the in yiys method described in Example 5 could be used ,0 eas,l> .so at and 
eharacterize as many additional mutant lysC genes encoding lysine-insensittve AKIII 

""'^ Another preferred class of AK genes are those encoding bi-func,ional enzymes 
AK-HDH, wherein both catalytic activities are insensitive to end-produc, mh.bition. A 
25 referred AK-HDH enzyme is E. coli AKII-HDHI. encoded by the mejL gene. As ind eat d 
above this gene has been isolated and sequenced previously. Thus, it can be eas, y obtained 
TZl the present invention by the same method used to obtain the ly^ gene described ,n 
Example 5. Alternatively, the gene can be isolated from £ coli genomic DNA via T>CR 
using oligonucleotide primers designed based on the pub.ished DNA sequence as described 

30 in Example 9. . . . 

In addition to these genes, several plant genes encoding lysme-.nsensi.ive AK re 
known. In barley, lysine plus threonine-resistant mutants bearing mutations m two un inked 
genes that result in two different lysine-insensitive AK isoenzymes hav< . been described 
[Bright et al., Nature. (1982), 299. 278-279, Rognes et al., Planla, (1983) 757 32-38 

35 Lda e, a,., Plan, Pnsiol., (1984), 76, 442-446,. In com, a lysine plus threomne-reststan, 
cell line had AK activity that was less sensitive to lysine inhibition than its parent line 
[Hibberd et al., Planta. (1980), 148, 183-187). A subsequently isolated lysine plus 
threonine-resistant eom mutant is altered at a different genetic locus and also produces 
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lysine-insensitive AK [Diedrick et al., Theo, Appl. Genet., (1990), 79, 209-215, Dotson 
et al Plania (1990), 752, 546-552]. In tobacco there are two AK enzymes in leaves, one 
lysine-sensitive and one threonine-sensitive. A lysine plus threonine-resistant tobacco 
mutant that expressed completely lysine-insensitive AK has been described [Frankard et al., 
Theor Appl. Genet., (1991), 82, 273-282]. These plant mutants could serve as sources of 
genes encoding lysine-insensitive AK and used, based on the teachings herein, to increase 
the accumulation of methionine in the seeds of transformed plants. 

A partial amino acid sequence of AK from carrot has been reported [Wilson et al. 
Plant Physiol., (1991), P7,1323-1328]. Using this information a set of degenerate DNA 
oligonucleotides could be designed, synthesized and used as hybridization probes to permit 
the isolation of the carrot AK gene. Recently the carrot AK gene has been isolated and its 
nucleotide sequence has been determined [Matthews et al., U.S.S.N, (1991), 07, 746,705]. 
This gene was used as a heterologous hybridization probe to isolate the Arabidopsis thahana 
AK-HDH gene [Ghislain et al., Plant Mol. Biol, (1994), 24, 835-851 ], and thus can be used 
as a heterologous hybridization probe to isolate the plant genes encoding lysine-insensitive 
AK or AK-HDH described above. 
Methionine-Rich Storage Pr otein Genes 

It may be useful for certain applications to incorporate the excess free methionine 
produced via deregulation of the biosynthetic pathway into a storage protein. This can help 
20 to prevent metabolism of the excess free methionine into such products as S-adenosyl- 

methionine, which may be undesirable. The storage protein chosen should contain higher 
levels of methionine than average proteins. Ideally, these methionine-rich storage proteins 
should contain at least 15% methionine by weight. 

A number of methionine-rich plant seed storage proteins have been identified and 
25 their corresponding genes have been isolated. A gene in corn for a 1 5 kD zein protein 

containing about 15% methionine by weight [Pedersen et al., J. Biol. Chem., (1986), 261, 
6279-6284], a gene for a 10 kD zein protein containing about 30% methionine by weight 
[Kirihara et'al., Mol. Gen. Genet., (1988), 21, 477-484; Kirihara et al., Gene, (1988), 71, 
359-370] have been isolated. A gene from Brazil nut for a seed 2S albumin containing about 
30 24% methionine by weight has been isolated [Altenbach et al., Plant Mol. Biol., (1987), 8, 
239-250]. From rice a gene coding for a 10 kD seed prolamin containing about 25% 
methionine by weight has been isolated [Masumura et al., Plant Mol. Biol., (1989), 12, 
123-130]. A preferred gene, which encodes the most methionine-rich natural storage protein 
known is an 1 8 kD zein protein designated high sulfur zein (HSZ) containing about 37% 
35 methionine by weight that has recently been isolated [World Patent Publication No. 

WO 92/14822, see Example 6]. Thus, methionine-rich storage protein genes are readily 
available to one skilled in the art. 
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Below is provided a discussion concerning the construction of chimeric genes for 
high-level seed-specific expression of methionine-rich storage protein genes. In addition, 
there have been several reports on the expression of methionine-rich seed storage protein 
genes in transgenic plants. The high-methionine 2S albumin from Brazil nut has been 
5 expressed in the seeds of transformed tobacco under the control of the regulatory sequences 
from a bean phaseolin storage protein gene. The protein was efficiently processed from a 
17 kD precursor to the 9 kD and 3 kD subunits of the mature native protein. The 
accumulation of the methionine-rich protein in the tobacco seeds resulted in an up to 30% 
increase in the level of methionine in the seeds [Altenbach et al., Plant Mol. Biol, (1989), 

10 73 51 3-522] This methionine-rich storage protein has also been efficiently expressed in 

Canola seeds [Altenbach et al., Plant Mol. Biol., (1992), 18, 235-245.] In another case, high- 
level seed-specific expression of the 15 kD methionine-rich zein, under the control of the 
regulatory sequences from a bean phaseolin storage protein gene, was found in transformed 
tobacco; the signal sequence of the monocot precursor was also correctly processed in these 

15 transformed plants [Hoffman et al., EMBO J., (1987), 6, 3213-3221]. As another example, 
the 18 kD zein protein containing 37% methionine has been expressed in tobacco and 
soybean seeds [World Patent Publication No. WO 92/14822]. 

Construction of Chimeric Genes for Expression of 
MS. CS. AK. AK-HDH and methionine- r ir.h storage proteins in the Seeds of Plants 

20 In order to increase biosynthesis of methionine in seeds, suitable regulatory 

sequences are provided to create chimeric genes for high level seed-specific expression of 
the MS, CS, AK or AK-HDH and methionine-rich storage proteins. The replacement of the 
native regulatory sequences accomplishes three things: 1) any methionine-concentration- 
dependent regulatory sequences are removed, permitting biosynthesis to continue in the 

25 presence of high levels of free methionine, 2) any pleiotropic effects that the accumulation of 
excess free methionine might have on the vegetative growth of plants is prevented because 
the chimeric gene(s) is not expressed in vegetative tissue of the transformed plants 3) high 
level expression of the enzyme(s) and storage protein(s) is obtained in the seeds. 

The expression of foreign genes in plants is well-established [De Blaere et al., Meth. 

30 Enzymol , (1987), 143, 277-291]. Proper level of expression of MS, CS, AK or AK-HDH 

and methionine-rich storage protein mRNAs may require the use of different chimeric genes 
utilizing different promoters. Such chimeric genes can be transferred into host plants either 
together in a single expression vector or sequentially using more than one vector. Preferred 
among the higher plants and the seeds derived from them are soybean, rapeseed {Brasswa 

35 napus, B. campestris), sunflower (Helianthus annus), cotton (Gossypium hirsutum), corn, 
tobacco {Nicotiana Tubacum), alfalfa (Medicago sativa), wheat (Triticum sp.), barley 
(Hordeum vulgare), oats (Avena sativa, L), sorghum (Sorghum bicolor), rice (Oryza sanva), 
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and forage grasses. Expression in plants will use regulatory sequences functional in such 
plants. 

The origin of the promoter chosen to drive the expression of the coding sequence is. 
not critical as long as it has sufficient transcriptional activity to accomplish the invention by 
5 expressing translatable mRNA for MS, CS, AK or AK-HDH and methionine-rich storage 
protein genes in the desired host tissue. 

Preferred promoters are those that allow expression of the protein specifically in 
seeds This may be especially useful, since seeds are the primary source of vegetable amino 
acids and also since seed-specific expression will avoid any potential deleterious effect in 

1 0 non-seed organs. Examples of seed-specific promoters include, but are not limited to, the 
promoters of seed storage proteins. The seed storage proteins are strictly regulated, being 
expressed almost exclusively in seeds in a highly organ-specific and stage-specific manner 
[Higgins et al., Ann. Rev. Plant Physiol., (1984), 35, 191-221 ; Goldberg et al., Cell. (1 989), 
56, 149-160; Thompson et al., BioEssays, (1989), 10, 108-1 13]. Moreover, different seed 

1 5 storage proteins may be expressed at different stages of seed development. 

There are currently numerous examples for seed-specific expression of seed storage 
protein genes in transgenic dicotyledonous plants. These include genes from dicotyledonous 
plants for bean P -phaseolin [Sengupta-Goplalan et al., Proc. Natl Acad. Sci. USA, (1985), 
82 3320-3324; Hoffman et al., Plant Mol. Biol, (1988), 11, 717-729], bean lectin [Voelker 

20 et al., EMBOJ., (1987), 6, 3571-3577], soybean lectin [Okamuro et al., Proc. Natl. Acad. 

Sci. USA, (1986), 83, 8240-8244], soybean kunitz trypsin inhibitor [Perez-Grau et al., Plant 
Cell (1989) 1 095-1 109], soybean p-conglycinin [Beachy et al., EMBOJ., (1985), 4, 
3047-3053" Barker et al., Proc. Natl. Acad. Sci. USA, (1988), 85, 458-462; Chen et al., 
EMBOJ., (1988), 7,297-302; Chen et al., Dev. Genet., (1989), 10, 1 12-122; Naito et al., 

25 Plant Mol. Biol., (19**), IU 109-123], pea vicilin [Higgins et al., Plant Mol. Biol., (1988), 
11 683-695], pea convicilin [Newbigin et al., Planta, (1990), 180, 461], pea legumin 
[Shirsat et al., Mol. Gen. Genetics, (1989), 215, 326]; rapeseed napin [Radke et al., Theor. 
Appl Genet (1988), 75, 685-694] as well as genes from monocotyledonous plants such as 
for maize 15 kD zein [Hoffman et al., EMBOJ., (1987), 6, 3213-3221 ; Schernthaner et al., 

30 EMBOJ (1988), 7, 1249-1253; Williamson et al., Plant Physiol., (1988), 88, 1002-1007], 
barley P -hordein [Marris et al., Plant Mol. Biol., (1988), 10, 359-366] and wheat glutenin 
[Colot et al EMBOJ., (1987), 6, 3559-3564]. Moreover, promoters of seed-specific genes, 
operably linked to heterologous coding sequences in chimeric gene constructs, also maintain 
their temporal and spatial expression pattern in transgenic plants. Such examples include 

35 linking either the Phaseolin or Arabidopsis 2S albumin promoters to the Brazil nut 2S 
albumin coding sequence and expressing such combinations in tobacco, Arabidopsis, or 
Brassica napus [Altenbach et al., Plant Mol Biol, (1989), 13, 513-522; Altenbach et al., 
Plant Mol. Biol, (1992), 18, 235-245; De Clercq et al., Plant Physiol, (1990), 94, 970-979], 
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bean lectin and bean p-phaseolin promoters to express luciferase [Riggs ct al., Plant Sci., 
(1989), 63, 41-57), and wheat glutenin promoters to express chloramphenicol acetyl 
transferase [Colot et al, EMBOJ., (1987), 6, 3559-3564]. 

Of particular use in the expression of the nucleic acid fragment of the invent.on will 
5 be the heterologous promoters from several extensively-characterized soybean seed storage 
protein genes such as those for the Kunitz trypsin inhibitor [Jofuku et al., Plant Cell, (1989), 
1, 1079-1093; Perez-Grau et al., Plant Cell, (1989), 1, 1095-1 109], glycinin [Nielson et al., 
Plant Cell (1989), 1, 313-328], p-conglycinin [Harada et al., Plant Cell, (1989), 1, 415-425]. 
Promoters of genes for a'- and P-subunits of soybean P -conglycinin storage protein will be 

1 0 particularly useful in expressing the CS, AK and AK-HDH mRNAs in the cotyledons at mid- 
to late-stages of soybean seed development [Beachy et al., EMBOJ., (1985), 4, 3047-3053; 
Barker et al., Proc. Natl. Acad. Sci. USA, (1988), 85, 458-462; Chen et al., EMBOJ., (1988), 
7 297-302; Chen et al., Dev. Genet., (1989), 10, 1 12-122; Naito et al., Plant Mol. Biol., 
0988) 11 109-123] in transgenic plants, since: a) there is very little position effect on their 

1 5 expression in transgenic seeds, and b) the two promoters show different temporal regulat.on: 
the promoter for the a'-subunit gene is expressed a few days before that for the P-subunit 
gene. 

Also of particular use in the expression of the nucleic acid fragments of the invention 
will be the promoters from several extensively characterized corn seed storage protein genes 

20 such as endosperm-specific promoters from the 10 kD zein [Kirihara et al., Gene, (1988), 71, 
359-370], the 27 kD zein [Prat et al., Gene, (1987), 52, 51-49; Gallardo et al., Plant Sci., 
(1988), 54, 21 1-281], and the 19 kD zein [Marks et al., J. Biol. Chem., (1985), 260, 
16451-16459] The relative transcriptional activities of these promoters in corn have been 
reported [Kodrzyck et al., Plant Cell, (1989), 1, 105-1 14] providing a basis for choosing a 

25 promoter for use in chimeric gene constructs for corn. For expression in corn embryos, the 
strong embryo-specific promoter from the GLB1 gene [Kriz, Biochemical Genetics, (1989), 
27 239-251, Wallace et al., Plant Physiol, (1991), 95, 973-975] can be used. 

It is envisioned that the introduction of enhancers or enhancer-like elements into 
other promoter constructs will also provide increased levels of primary transcription for MS, 

30 CS AK or AK-HDH and methionine-rich storage protein genes to accomplish the invention. 
These would include viral enhancers such as that found in the 35S promoter [Odell et al., 
Plant Mol. Biol, (1988), 10, 263-272], enhancers from the opine genes [Fromm et al., Plant 
Cell (1989) / 977-984], or enhancers from any other source that result in increased 
transcription when placed into a promoter operably linked to the nucleic acid fragment of the 

35 invention. 

Of particular importance is the DNA sequence element isolated from the gene for the 
a'-subunit of p-conglycinin that can confer 40-fold seed-specific enhancement to a 
constitutive promoter [Chen et al., EMBOJ, (1988), 7, 297-302; Chen et al., Dev. Genet., 
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(1989) 70, 1 12-122]. One skilled in the art can readily isolate this element and insert it 
within 'the promoter region of any gene in order to obtain seed-specific enhanced expression 
with the promoter in transgenic plants. Insertion of such an element in any seed-specific 
gene that is expressed at different times than the [3-conglycinin gene will result in expression 
5 in transgenic plants for a longer period during seed development. 

Any 3' non-coding region capable of providing a polyadenylation signal and other 
regulatory sequences that may be required for the proper expression of the CS and AK 
coding regions can be used to accomplish the invention. This would include the 3' end from 
any storage protein such as the 3' end of the bean phaseolin gene, the 3' end of the soybean 

10 p-conglycinin gene, the 3' end from viral genes such as the 3' end of the 35S or the 19S 

cauliflower mosaic virus transcripts, the 3' end from the opine synthesis genes, the 3' ends of 
ribulose 1 ,5-bisphosphate carboxylase or chlorophyll a/b binding protein, or 3' end sequences 
from any source such that the sequence employed provides the necessary regulatory 
information within its nucleic acid sequence to result in the proper expression of the 

1 5 promoter/coding region combination to which it is operably linked. There are numerous 

examples in the art that teach the usefulness of different 3' non-coding regions [for example, 
see Ingelbrecht et al., Plant Cell, (1989), 1, 671-680]. 

DNA sequences coding for intracellular localization sequences may be added to the 
AK or AK-HDH coding sequence if required for the proper expression of the proteins to 

20 accomplish the invention. Plant amino acid biosynthetic enzymes are known to be localized 
in the chloroplasts and therefore are synthesized with a chloroplast targeting signal. The 
plant-derived MS, CS and methionine-rich storage protein coding sequences include the 
native intracellular targeting signals, but bacterial proteins such as E. coli AKIII and 
AKII-HDHII have no such signal. A chloroplast transit sequence could, therefore, be fused 

25 to the coding sequence. Preferred chloroplast transit sequences are those of the small subunit 
of ribulose 1 ,5-bisphosphate carboxylase, e.g. from soybean [Berry-Lowe et al.,./. Mol. 
Appl. Genet, (1982), 1, 483-498] for use in dicotyledonous plants and from corn [Lebrun et 
al., Nucleic Acids Res., (1987), 75, 4360] for use in monocotyledonous plants. 
Introduction of Chimeric Genes into Plants 

30 Various methods of introducing a DNA sequence into eukaryotic cells (i.e., of 

transformation) of higher plants are available to those skilled in the art (see EPO publications 
0 295 959 A2 and 0 138 341 Al). Such methods include those based on transformation 
vectors utilizing the Ti and Ri plasmids of Agrobacterium spp. It is particularly preferred to 
use the binary type of these vectors. Ti-derived vectors transform a wide variety of higher 

35 plants, including monocotyledonous and dicotyledonous plants, such as soybean, cotton and 
rape [Pacciotti et al., Bio/Technology, (1985), 3, 241; Byrne et al., Plant Cell, Tissue and 
Organ Culture, (1987), 8, 3; Sukhapinda et al., Plant Mol. Biol., (1987), 5,209-216; Lorz et 
aUMo/. Gen. Genet., (1985), 199, 178; Potrykus, Mol. Gen. Genet., (1985), 199, 183]. 
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Other transformation methods are available to those skilled in the art, such as direct 
uptake of foreign DNA constructs [see EPO publication 0 295 959 A2], techniques of 
electroporation [see Fromm et al., Nature (London), (1986), 319, 791] or high-veloc.ty 
ballistic bombardment with metal particles coated with the nucleic acid constructs [see Kline 
5 et al., Nature (London), (1987), 327, 70, and see U.S. Patent No. 4,945,050]. Once 
transformed, the cells can be regenerated by those skilled in the art. 

Of particular relevance are the methods to transform foreign genes into commercially 
important crops, such as rapeseed [see De Block et al., Plant Physiol., (1989), 91, 694-701], 
sunflower [Everett et al., Bio/Technology, (1987), 5, 1201], soybean [McCabe et al., 

10 Bio/Technology, (1988), 6, 923; Hinchee et al., Bio/Technology, (1988), 6, 915; Chee et al., 
Plant Physiol., (1989), 91, 1212-1218; Christou et al., Proc. Natl. Acad. Sci USA, (1989), 
86, 7500-7504; EPO Publication 0 301 749 A2], and corn [Gordon-Kamm et al., Plant Cell, 
(1990), 2, 603-618; Fromm et al, Biotechnology, (1990), 8, 833-839]. 

There are a number of methods that can be used to obtain nucleic acid fragments and 

1 5 plants containing multiple chimeric genes of this invention. Chimeric genes for seed- 
specific expression of MS, CS, AK or AKHDH and methionine-rich storage proteins can be 
linked on a single nucleic acid fragment which can be used for transformation. Plants 
wherein two or more chimeric genes are linked on a nucleic acid fragment integrated into a 
plant chromosome are selected. In another method two or more of the MS, CS, AK or 

20 AKHDH and methionine-rich storage protein chimeric genes, carried on separate DNA 
fragments, are co-transformed into the target plant and transgenic plants carrying two or 
more chimeric genes linked on a nucleic acid fragment integrated into a plant chromosome 
are selected. Alternatively, a plant transformed with an MS chimeric gene can be crossed 
with a plant transformed with a CS, AK or AKHDH and/or a methionine-rich storage protein 

25 chimeric gene, and hybrid plants carrying two or more chimeric genes can be selected. In 
yet another method a plant transformed with one of the chimeric genes is re-transformed 
with another chimeric gene or genes. 
Ex pression of Chimeric Genes in Tra nsformed Plants 

To analyze for expression of the chimeric MS, CS, AK, AK-HDH and methionine- 

30 rich storage protein gene in seeds and for the consequences of expression on the amino acid 
content in the seeds, a seed meal can be prepared by any suitable method. The seed meal can 
be partially or completely defatted, via hexane extraction for example, if desired. Protein 
extracts can be prepared from the meal and analyzed for MS, CS, AK or HDH enzyme 
activities. Alternatively the presence of any of the proteins can be tested for 

3 5 immunologically by methods well-known to those skilled in the art. To measure free amino 
acid composition of the seeds, free amino acids can be extracted from the meal by methods 
known to those skilled in the art [for example, Bieleski et al.,Anal. Biochem., (1966), 17, 
278-293]. Amino acid composition can then be determined using any commercially 
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available amino acid analyzer. To measure total amino acid ^ ™* 

containing both protein-bound and free amino acids can be ^"'^ 
prot e in -bound amino acids and the composition can then be determmed »™ 
commercially avai.able amino acid analyzer. Seeds expressmg the MS, CS, AK, AK HDH 
a„Z methionine-rich storage proteins and with higher methionine content than the w,.d 

type seeds can thus be identified and propagated. 

F.XAMPLES 

The present invention is further defined in the blowing Examples 
and percentages are by weigh, and degrees are Celsius, unless otherw.se stated. 1. should be 
"cod that theseLamples, wh„e indicating preferred 

are given by way of illustration only. From the above d.scuss.on and these Examp es, one 
Z Z n the art can ascertain the essential characteristics of this invent.on, and w.thout 
tZZ the spirit and scope thereof, can make various changes and mod.ficat.ons of 
the invention to adapt it to various usages and conditions. 

FX AMPLE 1 
Tcnlatmn of Plant MS Genes 
cDNA libraries representing mRNAs from various corn, rice, soybean and wheat 
tissues were prepared. The characteristics of the libraries are described below. 
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Library 



TABLE 2 

cDNA Libraries fro m Corn, Rice . Soybean and Wheat 

Tissue 



Clone 



np 

p0026 
s2 

sdc2c 
sdp2c 



sdp3c 



Young Tobacco Green Seedling 

Corn Regenerating Callus 5 Days After 

Auxin Removal 

Soybean Seed, 19 Days After Flowering 

Soybean Developing Cotyledon (6-7 mm) 
Soybean Developing Pod (6-7 mm) 



Soybean Developing Pod (8-9 mm) 



ses4d 

sfll 

srm 

ssm 

wlln 

wlm96 



np.2d06.sk20 
p0026.ccras26rb 

s2.17bl0 
s2.17c08 

sdc2c.pk001.g7 

sdp2c.pk001.e21 
sdp2c.pk001.n20 
sdp2c.pk01 2.121 
sdp2c.pk013.dl2 
sdp2c.pk042.gl8 

sdp3c.pk001.j3 
sdp3c.pk006.n23 
sdp3c.pk020.il 0 

ses4d.pk0010.fl0 



sfll.pkl29.j22 
srm.pk0037.h2 
ssm.pk0070.h6 
wlln.pk0038.e8 
wlm96.pk0018.cl0 



Soybean Embryogenic Suspension 4 Days 
After Subculture 
Soybean Immature Flower 
Soybean Root Meristem 
Soybean Shoot Meristem 
Wheat Leaf From 7 Day Old Seedling* 
Wheat Seedling 96 Hours After Inoculation 

With Erysiphe zraminis f. sp tritici 

*This library was normalized essentially as described in U.S. Patent No. 5,482,845, 
incorporated herein by reference. 

cDNA libraries may be prepared by any one of many methods available. For 
example, the cDNAs may be introduced into plasmid vectors by first preparing the cDNA 
libraries in Uni-ZAP* XR vectors according to the manufacturer's protocol (Stratagene 
Cloning Systems, La Jolla, CA). The Uni-ZAP* XR libraries are converted into plasmid 
libraries according to the protocol provided by Stratagene. Upon conversion, cDNA inserts 
will be contained in the plasmid vector pBluescript. In addition, the cDNAs may be 
introduced directly into precut Bluescript II SK(+) vectors (Stratagene) using T4 DNA hgase 
(New England Biolabs), followed by transfection into DH10B cells according to the 
manufacturer's protocol (GIBCO BRL Products). Once the cDNA inserts are in plasmid 
vectors, plasmid DNAs are prepared from randomly picked bacterial colonies containing 
recombinant pBluescript plasmids, or the insert cDNA sequences are amplified via 
polymerase chain reaction using primers specific for vector sequences flanking the inserted 
cDNA sequences. Amplified insert DNAs or plasmid DNAs are sequenced in dye-pr.mer 
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sequencing reactions to generate partial cDNA sequences (expressed sequence tags or 
"ESTs"- see Adams et al., (1991) Science 252:1651). The resulting ESTs are analyzed us.ng 
a Perkin Elmer Model 377 fluorescent sequencer. Complete nucleotide sequence of the 
cDNAs may be determined using a ABI Model 373 A DN A sequencer. 

cDNA clones encoding methionine synthase were identified by conducting BLAST 
(Basic Local Alignment Search Tool; Altschul et al. (1993) J. Mol Biol 275:403-410; see 
also www.ncbi.nlm.nih.gov/BLAST/) searches for similarity to sequences contained m the 
BLAST "nr" database (comprising all non-redundant GenBank CDS translations, sequences 
derived from the 3-dimensional structure Brookhaven Protein Data Bank, the last major 
release of the SWISS-PROT protein sequence database, EMBL, and DDBJ databases). The 
cDNA sequences obtained in Example 1 were analyzed for similarity to all publicly 
available DNA sequences contained in the "nr" database using the BLASTN algorithm 
provided by the National Center for Biotechnology Information (NCBI). The DNA 
sequences were translated in all reading frames and compared for similarity to all publicly 
available protein sequences contained in the "nr" database using the BLASTX algorithm 
(Gish and States (1993) Nature Genetics 3:266-272) provided by the NCBI. For 
convenience, the P-value (probability) of observing a match of a cDNA sequence to a 
sequence contained in the searched databases merely by chance as calculated by BLAST are 
reported herein as "pLog" values, which represent the negative of the logarithm of the 
reported P-value. Accordingly, the greater the pLog value, the greater the likelihood that the 
cDNA sequence and the BLAST "hit" represent homologous proteins. 

A tobacco cDNA library was constructed using RNA derived from young green 
seedlings The RNA was sent to Stratagene Cloning Systems (La Jolla, CA) for the custom 
synthesis of a cDNA library in the Lambda Uni-Zap™ XR vector. Randomly picked 
25 individual cDNA inserts were amplified from phage DNA via PCR and the DNA was 

sequenced using a ABI Model 373A DNA sequencer. The DNA sequences were analyzed 
for similarity to all publicly available previous DNA sequences in the GeneBank Database 
using the BLASTN algorithm provided by the National Center for Biotechnology 
Information (NCBI). The DNA sequences were translated in all reading frames and 
compared for similarity to all publicly available previous protein sequences in the GeneBank 
Database using the BLASTX algorithm provided by the NCBI. 

The BLASTX search using clone np.2d06.sk20 revealed unmistakable similarity of 
the protein encoded by the DNA to E. coli MS and yeast MS. The amino acid sequence 
similarity began essentially at the start of both the E coli and yeast proteins indicating that 
the tobacco cDNA was likely to be a nearly full length cDNA. E. coli MS is a protein of 753 
amino acids and yeast MS contains 767 amino acids. Thus, the coding region of the tobacco 
MS would be expected to be 2250-2300 nucleotides long. A plasmid-borne vector carrying 
the tobacco MS cDNA insert was excised from the lambda phage using the standard lambda- 
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zap procedure provided by Stratagene and designated P BT77 1 . The ampicillin-resistant 
plasmid carried the cDNA insert in the vector pBluescript SK(-). Restriction endonuclease 
digests of the plasmid indicated that the cDNA insert was about 2.6 kb, thus long enough to 
encode a complete tobacco MS protein. 

The complete nucleotide sequence of the full length tobacco MS cDNA clone was 
determined using a ABI Model 373A DNA sequencer. SEQ ID NO:5 shows the nucleotide 
sequence of the tobacco MS cDNA and the corresponding amino acid sequence of the 
tobacco MS protein. The amino acid sequence of tobacco MS shows approximately 44% 
sequence similarity to either the E. coli or yeast proteins. 

Similarly, a corn cDNA library was constructed using RNA derived from developing 
kernels 15 days after pollination. The RNA was sent to Stratagene Cloning Systems 
(La Jolla CA) for the custom synthesis of a cDNA library in the Lambda Uni-Zap™ XR 
vector, randomly picked individual cDNA inserts were amplified from phage DNA via PCR, 
and the DNA was sequenced using a ABI Model 373A DNA sequencer. The DNA 
1 5 sequences were analyzed as described above. 

The BLASTX search using clone m .15.4.c03.sk20 revealed unmistakable similarity 
of the protein encoded by the DNA to E coli MS, yeast MS and tobacco MS (Figure 1). The 
amino acid sequence similarity began in the middle of both the E. coli and yeast proteins 
indicating that the corn cDNA was not a full length cDNA. The partial corn MS amino acid 
20 sequence shows 45% similarity to E. coli MS, 48% similarity to yeast MS and 61% 
similarity to tobacco MS. Using similar methods, another corn cDNA clone, 
P 0026.ccras26rb, was found to encode a full-length methionine synthase. The sequence of 
this corn MS cDNA is shown in SEQ ID NO: 1 . 

Soybean cDNA libraries were constructed using RNA derived from developing 
25 seeds. The RNA was used to create cDNA libraries in the Lambda™Uni-Zap XR vector; 
bacterial plasmid cDNA libraries were derived from the phages using the Lambda-Zap 
procedure. Randomly picked individual cDNA clones were amplified from bacterial DNA 
via PCR, using primers complimentary to plasmid DNA that flanked the cDNA insert. The 
DNA was sequenced using a ABI Model 377 DNA sequences, and the DNA sequences were 

30 analyzed as described above. 

The BLASTX search using clones s2.09c02, s2.11g02, s2.17M0, s2.17c08, s2.17f07 
and se2 1 lfl2 each revealed unmistakably similarity of the protein encoded by the DNA to 
E. coli MS, yeast MS, tobacco MS and Catharanthus roseus MS. A contiguous sequence 
was constructed using the 6 cDNA sequences above. The sequence of this soybean MS 

35 cDNA is shown in SEQ ID NO:55. The deduced amino acid sequence of the soybean cDNA 
showed strong similarity to the carboxy half of the E. coli, yeast and especially the tobacco 
and Catharanthus roseus MS proteins (Figure 1). A longer contig from various soybean 
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clones (Table 1) was assembled to encode a foil-length methionine synthase, the nucleotide 
sequence ofwhich is shown in SEQ ID NO:3. 

The BLASTX search using the sequences from clones listed in Table 3 revealed 
similarity of the polypeptides encoded by the cDNAs to methionine ^ ^ *^ 
plant species. Shown in Table 3 are the BLAST results for .nd.v.dual ESTs ( EST , the 
sequences of the entire cDNA inserts comprising the indicated cDNA clones ( FIS ), or 
contigs assembled from two or more ESTs ("Contig"): 

TABLE 3 

BLAST Results for Sequences Encoding Polypeptide, Homolopous to Methionine Synthase 
" — " " BLAST Results 



Clone 



Status 



Organism 



General pLog 
Identification No. Score_ 



p0026.ccras26rb 

Contig of 
s2.17c08 
sdc2c.pk001.g7 
sdp2c.pk001.e21 
sdp2c.pk001.n20 
sdp2c.pk0 12.121 
sdp2c.pk013.dl2 
sdp2c.pk042.gl8 
sdp3c.pk001.j3 
sdp3c.pk006.n23 
sdp3c.pk020.il0 
ses4d.pk0010.fl0 
sfll.pkl29.j22 
srm.pk0037.h2 
ssm.pk0070.h6 

np.2d06.sk20 

wlm96.pk0018.cl0 

wlln.pk0038.e8 



FIS 
Contig 



Catharanthus roseus 
Arabidopsis thaliana 



1362086 
2738248 



>254.00 
>254.00 



FIS 
EST 

EST 



Catharanthus roseus 
Mesembryanthemum 
crystallinum 
Catharanthus roseus 



1362086 
1814403 

1362086 



>254.00 
70.40 

29.30 
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Figure 1 presents an alignment of the amino acid sequences set forth in SEQ ID 
NOs-2 4 6 8 and 10 and the Catharanthus roseus sequence (SEQ ID NO:l 1). Thedatam 
Table ; represents a calculation of the percent identity of the amino 

in SEQ ID NOs:2, 4, 6, 8 and 10 and the Catharanthus roseus sequence (SEQ ID NO.l 1). 
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T ABLE 4 



SEQ ID NO. 



2 
4 
6 
8 

10 



Percent Identity to 

87.5 
87.5 
92.4 
84.7 
57.8 



5 Sequence alignments and percent identity calculations were performed using the 

Megalign program of the LASERGENE bioinformatics computing suite (DNASTAR Inc., 
Madison, WI). Multiple alignment of the sequences was performed using the Clusta 
method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default 
parameters (GAP PENALTY=10, GAP LENGTH PEN ALT Y= 10). Default parameters for 
10 pairwise alignments using the Clustal method were KTUPLE 1. GAP PENALTY-3. 

WINDOWS and DIAGONALS SAVED=5. Sequence alignments and BLAST scores and 
probabilities indicate that the nucleic acid fragments comprising the instant cDNA clones 
encode a substantial portion of a methionine synthase. These sequences represent the first 
monocot (corn), soybean and tobacco cDNA sequences encoding full-length methionine 
15 synthase as well as the first wheat partial cDNA sequences encoding methionine synthase. 

EXAMPLE 2 

r^etT-n^n n f Chimeric Genes for Fxpression in E. coli 
The tobacco MS gene was modified to permit the construction of chimeric genes for 
expression in E. coli and plant seeds. First, a Bs P H I site (TCATGA) was »^u^ at the 
20 ATG start codon using oligonucleotides CF49 and (SEQ ID NO:12) CF50 (SEQ ID NO. 1 3). 
The oligonucleotides were annealed and inserted into pBT771 digested with EcoR I and 
EcoR V This takes advantage of the unique EcoR I site at the junction of the vector and 
cDNA and a unique EcoR V site about 20bp from the start codon. The result of this 
insertion is to remove the cDNA sequences upstream of the ATG start codon and to alter the 

25 second codon of the tobacco MS gene from GCA encoding alanine to AC A encoding 
threonine. Since threonine is the second amino acid in E. coli MS, the substitution of 
threonine for alanine in tobacco MS is not expected to affect the protein function. Insertion 
of an oligonucleotide with the correct sequence into this region was confirmed by DNA 
sequencing, yielding plasmid pBT772. 

30 Next a Kpn I site was added immediately following the translation stop codon. This 

was accomplished by using PCR employing P BT771 DNA as template and pnmers CF5 1 
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(SEQ ID N0:.4) - CF52 (SEQ .D NO:,5, .o generate a mod, fi e 280 base pa ragm n, 
hat was d.gested with Rsr ,. and Kpn , and mserted into similar* ^^Zl^ls 
DNA fragment replacement removes the 3' non-codmg sequences present n he tob 
La. insertion of a DNA fragment with the correct sequence was confirmed DN A 
sequencing, yielding plasmid P BT773, which contams a umque 2306 bp BspH Kp 
fragment carrying the tobacco MS coding region only. 

g To Z£ high ,eve, express.on of the tobacco MS gene in E ~ * ™£ 
version bacteria, express vector P ET-3a (Rosenberg e. a,., ^^^m 
was used. This expression vector employs the bacteriophage T7 RNA 

sssasssssas^gst 

mutagenesis The DNA sequence of pET-3aM in this region, 5 -CATATOU, 
Ts -C^IOG creating P BT430. Then pBT430 was .rther — ol^ MS g^ 
site downstream of the Nco I site a. the translation mmat.on codon. The ~ ^ 8 
was cut out of P BT773 as a 2300 bp BspH I-Kpn 1 fragment and mserted ,nto the 
described expression vector digested with Nco I and Kpn 1. 

For hlh level expression, a plasmid clone with the cDNA msert ,n the correct 
orientation relative to the T7 promoter can be transformed into E ~" 

T^zs^zz^"*** - be addea ,o a fmai conc r ion of 

l,IPlO(isopropynn harvested by 

0.4 mM and incubation can be continued fo , 3 h | » • C ^ mM 

centrifogation and re-suspended m 50 uL of 50 mM Tns HC1 P 

OTT and 0.2 mM phenyl methylsulfonyl fluoride^ £^^£^1***. 

One M of protem from ^^^t^T^ 
SDS-polyacrylamide gel electrophoresis. Gels can be oosei^ 
at the expected molecular weight. 

FX AMPLE 3 
Nation of a Plant CS Gene 
,n order to clone the corn CS gene, RN A was isolated from developing seeds , „ ^corn 
„ „• .■ ThicRNA was sent to Clontech Laboratories, Inc., (Palo 

was accomplished following the protocol provided by Clontech. unc 
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p.asmid Hbrary the ampicillin-resistant clones obtained carry .he cDNA insert in the vector 
prescript 7u-l Expression of the cDNA is under contro. of the ,acZ promoter on the 



vector 



Two phagemid libraries were generated using the mixtures of the Lambda Zap 
pbage and the filamentous helper phage of 1 00 mL to 1 mL. Two additional hbranes were 
generated using mixtures of 100 pL Lambda Zap I. to 1 0 pL helper phage and 20 pL 
Lambda Zap [1 to 10 pL helper phage. The titers of the phagemid prepara.tons we e 
regardless of the mixture used and were about 2 x , 0 3 ampicillin-reststant-transfectants per 
mL with E. coli strain XL 1 -Blue as the host. 

To identify clones that carried the CS gene, B. coU strain BOB105 
by introducing the F p.asmid from K coU strain XL,-b,ue into ^J™ 5 ^™ 
FEMS Microbio.. Let,. 21 :189] by conjugation. The genotype of BOBI05 „: F .JQUfl 

laclq D0tf)M15/talA37 mSIfiL The strain requires f"^" 
station in me mSiS gene that encodes CS. Functional express.on °^J^™ 
should complement the mutation and allow the strain to grow m the absence o m th o n m . 

To select for clones from the com cDNA library that earned the CS gene 1 00 pL of 
the phagemid library was mixed with 300 pL of an overnight culture of BOB.05 grown ,n L 
broth and incubated at 37° for 1 5 min. The cells were collected by centnfirgat.on, 
resuspended in 400 pL of M9 + vitamin B 1 broth and plated on M9 medta con.atntng 
vitamin Bl. glucose as a carbon and energy source, 20 pL threonine (to prevent the 
possibility of threonine starvation due to overexpression of CS), 100 pg m ; 
20 pg/mL tetracycline, and 0.16 mM IPTG (isopropylthio-P-galac.ostde). Ftfteen plates 
were prepared and incubated a, 37"C. The amount of phagemid added was expected to yteld 
about 2X10* ampici.lin-resis.an, transfectan, ^fecanrs) able 

Approximately 30 colon.es (an average of 2 per plate or 1 per .u / 
to grow in the absence of methionine were obtained. No colonies were observed ,f .he 
ph!gemids carrying the com cDNA library were no, added. Twelve c °-s were p.exed and 
colony purified by streaking on the same medium described above. Plasm, I DNA 
isolated from the 12 Cones and retransformed into BOB105. All of the .2 DNAs ye.ded 
methionine-independent transformants demonstrating that a plasmid-bome gene was 
responsible forthe phenotype. The sequence of the DNA insert in one of the P", 
FS1088, is shown in SEQ ID NO:16 and the deduced amino acid sequence of the o. , CS 
protein derived from the nucleotide sequence of SEQ ID NO:16 is shown tn SEQ ID N O.17. 
It is 1639 bp in length and contains a long open reading frame and a poly A tatl, md.catmg 
that i, too represents a corn cDNA. The deduced amino acid sequence of the open readtng 
frame shows 59 percent similarity and 34 percent identity to the published sequence of 
E coli CS, indicating that it represents a corn homolog ,o the £. coh metB gene. 
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The open reading frame in plasmid FS1088 continues to the end of the insert DNA, 
and does not include an ATG initiator codon, indicating that the cloned cDNA is incomplete. 
The open reading frame of FS 1088 is in frame with the initiator codon of the jacZ gene 
carried on the cloning vector. Thus, complementation of the metB mutation m BOB105 
5 results from expression of a fusion protein including 39 amino acids from P -galactosidase 
and the vector polylinker attached to the truncated corn CS protein. 

In order to clone the entire 5' end of the corn CS gene, the cDNA clone was used as a 
DNA hybridization probe to screen a genomic corn library. A genomic library of corn m 
bacteriophage lambda was purchased from Stratagene (La Jolla, California). Data sheets 
1 0 from the supplier indicated that the corn DNA was from etiolated Missouri 1 7 corn 

seedlings The vector was Lambda FIX- II carrying Xho I fragments 9-23 kb in size. A 
titer of 1 0 x 1010 plaque forming units ( P fu)/mL in the amplified stock was indicated by the 
supplier when purchased. Prior to screening, the library was re-titered and contained 2.0 x 

10 8 pfu/mL. .... 

, 5 The protocol used for screening the library by DNA hybridization was prov.ded by 

Clonetech (Palo Alto , California). From autoradiograms of duplicate filters, 1 1 plaques 
which hybridized to a corn CS cDNA probe were identified. After a second round of 
screening two of the original plaques, number 6-1 and number 10-1, showed posit.ve 
hybridization. These plaques were tested with the probe a third time; and well isolated 

20 plaques were picked from each original. Following a fourth probing all the plaques 
hybridized, indicating that pure clones had been isolated. 

DNA was prepared from these two phage clones, 6-1 and 10-1, using the protocol for 
plate lysate method [see Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press]. Restriction endonuclease digests and agarose gel 

25 electrophoresis showed the two clones to be identical. The DNA fragments from the agarose 
gel were "Southern-blotted" [see Sambrook et al. (1989) Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press] onto nylon filters and probed with labeled 
corn CS cDNA. A single 7.5 kb Sal I fragment and two Xba I fragments of 3.6 kb and 
3 2 kb hybridized to the probe. The 3.2 kb Xba I fragment hybridized weakly to the probe 

30 whereas the 3.6 kb Xba I and the 7.5 kb Sal I fragments hybridized strongly. 

The 3 6 kb Xba I fragment was cloned into the Xba I site of P GEM®-9Zf(-) that had 
been treated with calf intestinal alkaline phosphatase. Two subclones from each Xba I 
fragment with the fragments in both orientations with respect to pGEM®-9Zf(-) DNA were 
obtained following transformation of E. coli. The two 3.6 kb Xba I subclones were 

35 designated FS 1 1 79 and FS 11 80. 

Restriction enzyme analysis of the subclones indicated that the 3.6 kb Xba I fragment 
in FS1 179 and FS1 180 included the 5' region of the corn CS gene. DNA from FS1 1 80 was 
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sent to LARK Sequencing Technologies Inc. (Houston, TX) for complete DNA sequencing 
analysis The sequence of the entire 3639 bp Xba I fragment is shown in see SEQ ID NO:l 8. 

Complete sequence analysis of the 3639 bp Xba I fragment revealed 806 bp of 
sequence upstream from the protein coding region and 2833 bp of DNA encoding two-thuds 
of the corn CS protein. The 2833 bp includes seven exons and seven mtrons with the 3 
Xba I site located in the seventh intron. Table 1 describes the location and length of exons 
and introns in the sequence as well the number of amino acids encoded by the exons. The 
first exon includes the entire chloroplast targeting signal and 12 amino acids into the re gl on 
that shows amino acid sequence alignment with the E. coli protein. The last codon m Exon 7 
encodes amino acid 333 of corn CS as shown in SEQ ID NO: 17. 



TABLE 5 



FROM (bp) 
1 

807 
1195 
1302 
1406 
1490 
1564 
1647 
1816 
2508 
2568 
2661 
2865 
2948 
3035 



T O (bp) 
806 
1194 
1301 
1405 
1489 
1563 
1646 
1815 
2507 
2567 
2660 
2864 
2947 
3034 
3639 



REGION 
Promoter 
Exonl 
Intron 1 
Exon2 
Intron2 
Exon3 
Intron3 
Exon4 
Intron4 
Exon5 
IntronS 
Exon6 
Intron6 
Exon7 
Intron7 
"na = not applicable 

Comparison of the corn CS cDNA sequence to the genomic CS DNA sequence 
indicated that the cDNA of clone FS1088 did not contain the entire chloroplast targeting 
signal as anticipated. The cDNA was not truncated on the 5' end, but instead contained a 
1 70 bp deletion in the chloroplast transit sequence. 

The complete amino acid sequence of the corn CS protein derived from combining 
the amino terminal sequence deduced from the corn genomic DNA fragment of SEQ ID 



Len gth (bp) 
806 
387 
106 
103 
83 
73 
82 
168 
691 
59 
92 
203 
82 
86 
>604 



Number of 
Encoded 
Ami no A cids 

na* 

129 

na 

35 

na 

24 

na 

57 

na 

20 

na 

68 

na 

29 

na 
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NO:! 8 and .he carboxy terminal sequence from the com cDNA fragment of SEQ ID NO:16 
is shown in SEQ ID NO: 19. 

FX AMPLE 4 

^Hifi.a.ion nf the O- ™ "ene and Hirh level expression in E. co/ j 
As indicated in Example 2, the open reading frame in plasmid FS *« »™ 
CS gene does not include an ATG initiator codon. Oligonucleotide adaptors OTG145 and 
OTG146 were designed to add an initiator codon in frame with the CS coding sequence. 

OTG145' S'-AATTCATGAG TGCA-3' SEQIDNO:20 

10 OTG146 5'-AATTTGCACT CATG-3' SEQIDNO.21 

When annealed the oligonucleotides possess EcoR I sticky ends. Upon insertion into 
FS1088 in the desired orientation, an EcoR I site is present at the 5" end of *e «d^or, tta 
, 5 ATG initiator codon is within a BspH I restriction endonuclease site, and the EcoR stte 
the 3' end of the adaptor is destroyed. The oligonucleotides were ligated mto EcoR I 
digested F S1 088, and insertion of the correct sequence in the desired orientat.on was venfted 

by DNA sequencing. 

To achieve high level expression of the corn CS gene in M. col, the bacterial 
20 expression vector P BT430 (see Example 2) was used. The corn CS gene was cut out of the 
modified FS1088 plasmid described above as an 1482 bp BspH 1 fragment and inserted mto 
,he expression vector pBT430 digested with Nco I. Clones with the CS gene in the proper 
orientation were identified by restriction enzyme mapping. 

For high level expression each of the plasmlds was transformed into E. col, strain 
25 BL2,(DE3)orBL21(DE3),ys Sl S,u^^ 

were ro J in LB medium containing ampic.lin (100 mg/L) a. 37°C. At an optical density 
at 600 nm of approximately 1, 1PTG (isopropyl«hio-P-galac.oside, the inducer) was added to 
a final concentration of 0.4 mM and incubation was continued overnight. The cells were 
collected by centrifiigation and resuspended in >/20th the original culture volume in 50 mM 
30 NaCl; 50 mM Tris-Cl, pH 7.5; 1 mM EDTA, and frozen at -20°C. Frozen ahquots of 1 mL 
were thawed at 37°C and sonicated, in an ice-water bath, to lyse the cells. The lysa e , was 
centriftiged a. 4»C for 5 min at 12,000 rpm. The supernatant was removed and the pellet 
resuspended in 1 mL of the above buffer. 

The supernatant and pellet fractions of uninduced and IPTG-induced cultures were 
35 analyzed by SDS polyacrylamide gel electrophoresis. The best of the 

the induced culture of the BL2,(DE3)lysS host. The major protetn visible by <^»»« 
blue staining in the pellet fraction of this induced culture had a molecular weigh, of about 
54 kd, the expected size for corn CS. 
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F Y AMPLE 5 
T^winn nf the E. coli lysCGeneapdmutati^ 
in ] ytC resultip p in l ysine-inse.nsitive AKIU 
The E coli teC gene has been Coned, restriction endonuclease mapped and 
sequenced previously [Cassan e. al., J. Biol. Chen,., (1986), 261, 1052-10 57]. Forthe 

ordered hbrary of 3400 overlapp ing ^-ents o^ ^/- LW, CO * ^ 

Akiyama and Isono Kohara et al., Ce», (198/), W . . 

phyL. map of the whole £ coli chromosome and ties the phys.cal map to the genehc map. 
Zl the knowledge of the map position of taC a. 90 min. on the £ genet.c map 
[Thezeetal.,./. (1974), 777, ,33-143], the restriction endonuclease map of the 

Led gene [Cassan et al., J. «rt Chen,., (1986), 2o7, 1052-1057], and the restnc ton 
lonuclease map of the cloned ONA fragments in the E.coli hWy [Kohara ^O* 
(1987) 50 595-508], it was possible to choose lambda phages 4E5 and 7A4 [Kohara al., 
Ce7(1987) 50 59 -508] as likely candidates for carrying the toC gene. The phages were 
™1 in H Uu.ture from single places as described [see C-P— * 
Llogy (1987) Ausube. et al. eds. John Wiley & Sons New York] us.ng LE392 as host [see 
Samblk et a,. (1989) Mo,ecu,ar Cloning: A Laboratory Manual, Cold Spnng Harbor 
Laboratory Press], Phage DNA was prepared by phenol extracon as descnbed [see Cun-ent 
Pro.oco.sl Molecular Biology (1987) Ausubel e, al. eds. John Wiley ft Sons New York . 

From the sequence of the gene several restriction endonuclease fragments d.agnost.c 
for the iisC gene were predicted, including an .860 bp EcoR 1-Nhe I fragment, a 2140 bp 
EcoR I-Xmn ! fragment and a 1600 bp EcoR .-BamH I fragment. Each of these fragmen s 
was detected in both of the phage DNAs confirming that these carried the !ysC gen. The 
EcoR 1-Nhe I fragment was isolated and subcloned in plasmid P BR322 d.gested w,,h the 
same enzymes, yielding an ampicillin-resistant, tetracycline-sensitive E. coli transformant. 

The plasmid was designated pBT436. 

To establish that the cloned lysC gene was functional, pBT436 was transformed mto 
E. coli strain Gifl06M. (£ coli Genetic Stock Center strain CGSC-5074) which has 
mutations in each of the three E. coli AK genes [Theze e. al., J. BaCeriol., (1974), 1 7, 
1 33-143] This strain lacks all AK activity and therefore requtres diaminop.melate (a 
precursor to lysine which is also essential for cell wall biosynthesis), threomne and 
methionine. In the transformed strain all these nutritional requirements were reheved 
demonstrating that the cloned !yjC gene encoded functional AKII1. 

Addition of lysine (or diaminopimelate which is readily converted to lysme m v.vo) 
a, a concentration of approximately 0.2 mM ,0 the growth medium inhibit : the growth of 
Gin06M, transformed with pBT436. M9 media [see Sambrook et a.. (.989 Mo.ecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press] supplemented w,th 
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the arginine and isoleucine, required for Gifl06Ml growth, and ampicilhn, to maintain 
selection for the pBT436 plasmid, was used. This inhibition is reversed by addition of 
threonine plus methionine to the growth media. These results indicated that AKIII could be 
inhibited by exogenously added lysine leading to starvation for the other amino acids derived 
5 from aspartate. This property of P BT436-transformed Gifl06Ml was used to select for 
mutations in lysC that encoded ly sine-insensitive AKIII. 

Single colonies of Gifl06Ml transformed with pBT436 were picked and resuspended 
in 200 uL of a mixture of 100 uL 1% lysine plus 100 uL of M9 media. The entire cell 
suspension containing 10M0* cells was spread on a petri dish containing M9 media 

1 0 supplemented with the arginine, isoleucine, and ampicillin. Sixteen petn dishes were thus 
prepared From 1 to 20 colonies appeared on 1 1 of the 16 petri dishes. One or two (if 
available) colonies were picked and retested for lysine resistance and from this nine lys.ne- 
resistant clones were obtained. Plasmid DNA was prepared from eight of these and re- 
transformed into Gifl06Ml to determine whether the lysine resistance determinant was 

15 plasmid-borne. Six of the eight plasmid DNAs yielded lysine-resistant colonies. Three of 
these six carried lysC genes encoding AKIII that was uninhibited by 1 5 mM lysine, whereas 
wild type AKIII is 50% inhibited by 0.3-0.4 mM lysine and >90% inhibited by 1 mM lysine. 

To determine the molecular basis for lysine-resistance the sequences of the wild type 
lysC gene and three mutant genes were determined. The sequence of the wild type lysC 

20 gene cloned in pBT436 (SEQ ID NO:22) differed from the published lysC sequence in the 
coding region at 5 positions. Four of these nucleotide differences were at the third position 
in a codon and would not result in a change in the amino acid sequence of the AKIII protein. 
One of the differences would result in a cysteine to glycine substitution at amino acid 58 of 
AKIII. These differences are probably due to the different strains from which the lysC genes 

25 were cloned. 

The sequences of the three mutant lysC genes that encoded lysine-insensitive AK 
each differed from the wild type sequence by a single nucleotide, resulting in a single amino 
acid substitution in the protein. Mutant M2 had an A substituted for a G at nucleotide 954 of 
SEQ ID NO:22 resulting in an isoleucine for methionine substitution at amino acid 318 and 
30 mutants M3 and M4 had identical T for C substitutions at nucleotide 1055 of SEQ ID NO:22 
resulting in an isoleucine for threonine substitution at amino acid 352. Thus, either of these 
single amino acid substitutions is sufficient to render the AKIII enzyme insensitive to lysine 
inhibition. 

An Nco I (CCATGG) site was inserted at the translation initiation codon of the lysU 
35 gene using the following oligonucleotides: 

5--GATCCATGGC TGAAATTGTT GTCTCCAAAT TTGGCG-3' SEQ ID NO:24 
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5--GTACCGCCAA ATTTGGAGAC AAC AATTTCA GCCATG-3" SEQ ID NO:25 

When annealed these oligonucleotides have BamH I and Asp 718 "sticky" ends. The 
plasmid P BT436 was digested with BamH I, which cuts upstream of the lysC coding 
sequence and Asp 718 which cuts 31 nucleotides downstream of the initiation codon. The 
annealed oligonucleotides were ligated to the plasmid vector and E. calx transformants were 
obtained. Plasmid DNA was prepared and screened for insertion of the oligonucleotides 
based on the presence of an Nco I site. A plasmid containing the site was sequenced to 
assure that the insertion was correct, and was designated pBT457. In addition to creating an 
Nco I site at the initiation codon of lysC, this oligonucleotide insertion changed the second 
codon from TCT, coding for serine, to GCT, coding for alanine. This amino acid 
substitution has no apparent effect on the AKIII enzyme activity. 

The lysC gene was cut out of plasmid P BT457 as a 1 560 bp Nco I-EcoR I fragment 
and inserted into the expression vector pBT430 digested with the same enzymes, yielding 
plasmid P BT461 . For expression of the mutant lyj>C-M4 gene pBT461 was digested with 
Kpn I-EcoR I, which removes the wild type iysC gene from about 30 nucleotides 
downstream from the translation start codon, and inserting the analogous Kpn I-EcoR I 
fragments from the mutant genes yielding plasmid pBT492. 

EXAMPLE 6 
Molecular Cloning of Corn Genes Enco ding 
Methionine-Ric-h Seed Storage Prot eins 
A high methionine 10 kD zein gene [Kirihara et al., Mol. Gen. Genet., (1988), 21 U 
477-484] was isolated from corn genomic DNA using PCR. Two oligonucleotides 30 bases 
long flanking this gene were synthesized using an Applied Biosystems DNA synthesizer. 
25 Oligomer SM56 (SEQ ID NO:26) codes for the positive strand spanning the first ten ammo 
acids: 

SM56 5--ATGGCAGCCA AGATGCTTGC ATTGTTCGCT-3' SEQ ID NO:26 
30 Oligomer CFC77 (SEQ ID NO:27) codes for the negative strand spanning the last ten amino 
acids: 

CFC77 S'-GAATGCAGCA CCAACAAAGG GTTGCTGT A A-3 ' SEQ ID NO:27 

35 These were employed to generate by polymerase chain reaction (PCR) the 1 0 kD 

coding region using maize genomic DNA from strain B85 as the template. PCR was 
performed using a Perkin-Elmer Cetus kit according to the instructions of the vendor on a 
thermocycler manufactured by the same company. The reaction product, when run on a 1 /. 
agarose gel and stained with ethidium bromide, showed a strong DNA band of the size 
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expected for the 10 kD zein gene, 450 bp, with a faint band at about 650 bp. The 450 bp 
band was electro-eluted onto DEAE cellulose membrane (Schleicher & Schuell) and 
subsequently eluted from the membrane at 65°C with 1 M NaCl, 0.1 mM EDTA, 20 mM 
Tris-Cl P H 8 0 The DNA was ethanol precipitated and rinsed with 70% ethanol and dried. 
5 The dried pellet was resuspended in 10 water and an aliquot (usually 1 uL) was used for 
another set of PCR reactions, to generate by asymmetric priming single-stranded linear 
DNAs For this, the primers SM56 and CFC77 were present in a 1 :20 molar ratio and 20: 1 
molar ratio. The products, both positive and negative strands of the 10 kD zein gene, were 
phenol extracted, ethanol precipitated, and passed through NACS (Bethesda Research 
1 0 Laboratories) columns to remove the excess oligomers. The eluates were ethanol 

precipitated twice, rinsed with 70% ethanol, and dried. DNA sequencing was done using the 
appropriate complementary primers and a sequenase kit from United States B.ochcmicals 
Company according to the vendors instructions. The sequence deviated from the published 
coding sequence (Kirihara et al., Gene, (1988), 71, 359-370) in one base pair at nucleotide 
15 position 1504 of the published sequence. An A was changed to a G which resulted in the 
change of amino acid 123 (with the initiator methionine as amino acid 1) from Gin to Arg 
is not known if the detected mutation was generated during the PCR reaction or if this is 
another allele of the maize 10 kD zein gene. A radioactive probe was made by nick- 
translation of the PCR-generated 10 kD zein gene using 32 P -dCTP and a nick-translation kit 
20 purchased from Bethesda Research Laboratories. 

A genomic library of corn in bacteriophage lambda was purchased from Clontech 
(Palo Alto CA). Data sheets from the supplier indicated that the corn DNA was from seven- 
day-old seedlings grown in the dark. The vector was l-EMBL-3 carrying BamHI fragments 
15 kb in average size. A titer of 1 to 9 x 10' plaque forming units (pfu)/mL was indicated by 
25 the supplier. Upon its arrival the library was titered and contained 2.5 x 1 0* pfu/mL. 

The protocol for screening the library by DNA hybridization was provided by the 
vendor. About 30,000 pfu were plated per 1 50-mm plate on a total of 1 5 Luria Broth (LB) 
agar plates giving 450,000 plaques. Plating was done using E. coli LE392 grown in LB + 
0 2% maltose as the host and LB-7.2% agarose as the plating medium. The plaques were 
30 absorbed onto nitrocellulose filters (Millipore HATF, 0.45 mM pore size), denatured in 0.5M 
NaOH neutralized in 1.5 M NaCl, 0.5 M Tris-Cl pH 7.5, and rinsed in 3XSSC [Sambrook et 
al (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press]. 
The filters were blotted on Whatman 3MM paper and heated in a vacuum oven at 80°C for 
two hours to allow firm anchorage of phage DNA in the membranes. 
35 The 32 P -labelled 1 0 kD zein was used as a hybridization probe to screen the library. 

The fifteen 1 50-mm nitrocellulose filters carrying the 1 phage plaques were screened using 
radioactive 10 kD gene probe. After four hours prehybridizing at 60°C in 50XSSPE, 5X 
Denhardt's, [see Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
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Spring Harbor Laboratory Press] 0.1% SDS, 100 ug/mL calf thymus DNA, the filters were 
transferred to fresh hybridization mix containing the denatured radiolabeled 10 kD zein gene 
(cpm/mL) and stored overnight at 60°C. They were rinsed the following day under stnngent 
conditions: one hour at room temp in 2XSSC - 0.05% SDS and one hour at 68°C ,n 1XSSC 
5 - 0 1 % SDS Blotting on 3 MM Whatman paper followed, then air drying and 

autoradiography at -70°C with Kodak XAR-5 films with DuPont Cronex® Lightning Plus 
intensifying screens. From these autoradiograms, 20 hybridizing plaques were identified. 
These plaques were picked from the original petri plate and plated out at a dilution to yield 
about 100 plaques per 80-mm plate. These plaques were absorbed to nitrocellulose filters 
1 0 and re-probed using the same procedure. After autoradiography only one of the original 
plaques, number 10, showed two hybridizing plaques. These plaques were tested with the 
probe a third time; all the progeny plaques hybridized, indicating that pure clones had been 
isolated. 

DNA was prepared from these two phage clones, 10-1, 10-2, using the protocol for 
15 DNA isolation from small-scale liquid 1-phage lysates (Ansul et al. (1987) Current Protocols 
in Molecular Biology, pp. 1.12.2, 1.13.5-6). Restriction endonuclease digests and agarose 
gel electrophoresis showed the two clones to be identical. The DNA fragments from the 
agarose gel were "Southern-blotted" [see Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press] onto nitrocellulose membrane 
20 filters and probed with radioactively-labeled 1 0 kD zein DNA generated by nick translation. 
A single 7.5 kb BamH I fragment and a single 1 .4 kb Xba I fragment hybridized to the probe. 

The 7 5 kb BamH I fragment was isolated from a BamH I digest of the 1 DNA run on 
an 0.5% low melting point (LMP) agarose gel. The 7.5 kb band was excised, melted, and 
diluted into 0.5 M NaCl and loaded onto a NACS column, which was then washed with 
25 0 5 M NaCl, 1 0 mM Tris-Cl, pH 7.2, 1 mM EDTA and the fragment eluted with 2 M NaCl, 
10 mM Tris-Cl, pH 7.2, 1 mM EDTA. This fragment was ligated to the phagemid pTZl 8R 
(Pharmacia) which had been cleaved with BamH I and treated with calf intestinal alkaline 
phosphatase [see Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory Press] to prevent ligation of the phagemid to itself. Subclones 
30 with these fragments in both orientations with respect to the pTZl 8R DNA were obtained 
following transformation of E. coli. 

An Xba I digest of the cloned 1 phage DNA was run on an 0.8% agarose gel and a 
1 4 kb fragment was isolated using DEAE cellulose membrane (same procedure as for the 
PCR-generated 10 kD zein DNA fragment described above). This fragment was ligated to 
35 pTZ18R cut with Xba I in the same way as described above. Subclones with these fragments 
in both orientations with respect to the pTZ18R DNA, designated pX8 and pXlO, were 
obtained following transformation of E. coli. Single-stranded DNAs were made from the 
subclones using the protocol provided by Pharmacia. The entire 1.4 kb Xba I fragments 
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were sequenced. An addifional 700 bases adjacen, to the Xba 1 fragment was sequenced 
IZ the BamH I fragment in done P B3 (fragment P B3 is in the same onentat.on as pX8) 
giving a total of 2123 bases of sequence (SEQ ID NO:28). 

Encoded on tbis fragment is another methionine-rich zein, wh.ch ts related ,h 
,0 kD zein and has been designated High Sulfur Zein (HSZ) [see World Paten, 
No WO 92/14822]. From the deduced amino acid sequence of the protem, ,.s molecular 
weigh, is approximately 21 kD and it is about 38% methionine by we.ght. 

FX AMPLE 7 

Momfication of the 11" Gene by -^-Direcled Mutagenesis 
Three Nco I sites were present in the M kD Xba I fragment carrying the HSZ gene, 
all in the HSZ coding region. It was desirable to maintain only one of these s,tes 
(nucleotides 751-756 in SEQ ID NO:28) that included the translate start codon. Therefore, 
be Nco . sites at positions 870-875 and 1333-1338 were eliminated by ohgonucleo.Kie- 
directed site-specific mutagenesis [see Sambrook et al. (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press]. The oligonucleohdes 
synthesized for the mutagenesis were: 

CFC99 S'-ATGAACCCTT GGATGCA-3* SE Q ID NO:3 ° 

CFC98 S'-CCCACAGCAA TGGCGAT-3' SE Q 10 NO:3 1 

Mutagenesis was carried out using a kit purchased from Bio-Rad (Richmond, C A), following 

the protocol provided by the vendor. 

The process changed the A to T at 872 and the C to A at .334. These were both a, 
the third position of their respective codons and resulted in no change in the ^am.nc , ac.d 

... /~i * tn c T still coding for Pro and u U u to o ^ 
«>mi^nre encoded by the gene, withCC AtotLI,suu cuumgi _ 

T 1 1 col for A L The plasmid clone confining the modified HSZ gene ^ a smg e 
Nco I site a, the ATG start codon was designated , pX8m. Because — « has 
uniqueXbaIsiteatthestopcodonofthegene(1384-1389,btyiuiN U j, y 

digest of the DNA with Nco I and Xba 1 yields a 637 bp fragment contammg the enure 
coding sequence of the precursor HSZ polypeptide (SEQ ID NO:32). 

„ was desirable to create a form of the HSZ gene with altemattve umque res.ncOon 
endonuelease sites jus. pas, the end of the coding region. To do this oligonucleofdes 
CFC104 (SEQ ID NO:34) and CFC105 (SEQ ID NO:35): 

CFC104 5'-CTAGCCCGOGTAC -3' (SEQ ID NO:34) 

CFC105 3'- GGGCCCATGGATC-5' (SEQIDNO:35) 
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were annealed and ligated into the Xba I she, introducing two new restncuon s.tes Sma I 
and Kpn I, and destroying the Xba I site. The now unique Xba I site from nucleofde 1 -6 ,n 
SEQ ID NO:28 and the Ssp I site from nuc.eotide 1 823- 1 828 in SEQ ID NO:28 were used to 
obtain a fragment that included the HSZ coding region plus its 5' and 3' regulatory regions. 
5 This fragment was cloned into the commercially-available vector P TZ19R (Pharmac.a) 
digested with Xba I and Sma I, yielding plasmid pCC 10. 

It was desirable to create an altered form of the HSZ gene with a unique restncuon 
endonuclease site at the start of the mature protein, i .e., with the amino termmal s.gnal 
sequence removed. To accomplish this a DNA fragment was generated usmg PCR. 
1 0 Template DNA for the PCR reaction was plasmid pX8m. Oligonucleotide pnmers for the 
reaction were: 

CFC106: V-rrACT TCATGA CCCATATCCCAGGGCACTT-3' SEQ ID NO:36 
1 5 CFC88: V-TTCTA TCTAGA ATGCAGCACCAACAAAGGG-3' SEQ ID NO:37 

The CFC 106 (SEQ ID NO:36) oligonucleotide provided the PCR-generated fragment with a 
BspH I site (underlined), which when digested with BspH I results in a cohesive-end 
identical to that generated by an Nco I digest. This site was located at the junction of the 

20 signal sequence and the mature HSZ coding sequence. The CFC88 (SEQ ID NO:37) 

oligonucleotide provided the PCR-generated fragment with an Xba I site (underlined) at the 
translation terminus of the HSZ gene. The BspH I-Xba I fragment (SEQ ID NO:38) 
obtained by digestion of the PCR-generated fragment, encodes the mature form of HSZ with 
the addition of a methionine residue at the amino terminus of the protein to permit initiation 

25 of translation. 

EXAMPLE 8 
ronstniction of C himeric Genes for 
Fv prP^on of CS. AK J TT-M4 and HSZ ProtHns in the Seeds of Monocot Plants 
The following chimeric genes were made for transformation into monocot plants: 

30 

globulin 1 promo ter/mcts/lysC-M4/globul in 1 3' region 
globulin 1 promoter/corn CS coding region/globulin 1 3 1 region 
glutelin 2 promoter/mcts/lysC-M4/NOS 3' region 
glutelin 2 promoter/corn CS coding region/10 kD 3' region 
35 10 kD promoter/HSZ coding region/ 10 kD 3' region 

glutelin 2 promoter/HSZ coding region/10 kD 3' region 

A gene expression cassette employing the 10 kD zein regulatory sequences includes 
about 925 nucleotides upstream (5') from the translation initiation codon and about 945 
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nucleotides downstream (3 1 ) from the translation stop codon. The entire cassette is flanked 
by an EcoR I site at the 5' end and BamH I, Sal I and Hind III sites at the 3 1 end. The DNA 
sequence of these regulatory regions have been described in the literature [Kirihara et al,. 
Gene (1988) 71, 359-370] and DNA fragments carrying these regulatory sequences were 
5 obtained from corn genomic DNA via PCR. Between the 5' and 3' regions is a unique Nco I 
site, which includes the ATG translation initiation codon. The oligonucleotides CFC104 
(SEQ ID NO:34) and CFC105 (SEQ ID NO:35) (see Example 7) were inserted at the Xba I 
site near the 10 kD zein translation stop codon, thus adding a unique Sma I site. An 
Nco I-Sma I fragment containing the HSZ coding region was isolated from plasmid pCCIO 

1 0 (see Example 7) and inserted into Nco I-Sma I digested 1 0 kD zein expression cassette 
creating the chimeric gene: 1 0 kD promoter/HSZ coding region/1 0 kD 3' region. 

The glutelin 2 promoter was cloned from corn genomic DNA using PCR with 
primers based on the published sequence [Reina et al., Nucleic Acids Res., (1 990), 18, 
6426-6426]. The promoter fragment includes 1020 nucleotides upstream from the ATG 

1 5 translation start codon. An Nco I site was introduced via PCR at the ATG start site to allow 
for direct translation^ fusions. A BamH I site was introduced on the 5 1 end of the promoter. 
The 1.02 kb BamH I to Nco I promoter fragment was linked to an Nco I to Hind III fragment 
carrying the HSZ coding region/10 kD 3' region described above yielding the chimeric gene: 
glutelin 2 promoter/HSZ coding region/10 kD 3' region in a plasmid designated pML103. 

20 The globulin 1 promoter and 3' sequences were isolated from a Clontech corn 

genomic DNA library using oligonucleotide probes based on the published sequence of the 
globulin 1 gene [Kriz et al., Plant Physiol, (1989), 91, 636]. The cloned segment includes 
the promoter fragment extending 1078 nucleotides upstream from the ATG translation start 
codon, the entire globulin coding sequence including introns and the 3' sequence extending 

25 803 bases from the translation^ stop. To allow replacement of the globulin 1 coding 

sequence with other coding sequences an Nco I site was introduced at the ATG start codon, 
and Kpn I and Xba I sites were introduced following the translation^ stop codon via PCR to 
create vector pCC50. There is a second Nco I site within the globulin 1 promoter fragment. 
The globulin 1 gene cassette is flanked by Hind III sites. 

30 Plant amino acid biosynthetic enzymes are known to be localized in the chloroplasts 

and therefore are synthesized with a chloroplast targeting signal. Bacterial proteins such as 
AKIII have no such signal. A chloroplast transit sequence (cts) was therefore fused to the 
lvsC-M4 coding sequence in the chimeric genes described below. For corn the cts used was 
based on the the cts of the small subunit of ribulose 1,5-bisphosphate carboxylase from corn 

35 [Lebrun et al., Nucleic Acids Res., (1987), 75, 4360] and is designated mcts. The 

oligonucleotides SEQ ID NOS:40-45 were synthesized and used to attach the mcts to 
lvsC-M4. 
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Oligonucleotides SEQ ID NO:40 and SEQ ID NO:41 , which encode the carboxy 
terminal part of the corn chloroplast targeting signal, were annealed, resulting in Xba I and 
Nco I compatible ends, purified via polyacrylamide gel electrophoresis, and inserted into 
Xba I plus Nco I digested P BT492 (see Example 5). The insertion of the correct sequence 
5 was verified by DNA sequencing yielding pBT556. Oligonucleotides SEQ ID NO:42 and 
SEQ ID NO:43, which encode the middle part of the chloroplast targeting signal, were 
annealed, resulting in Bgl II and Xba I compatible ends, purified via polyacrylamide gel 
electrophoresis, and inserted into Bgl II and Xba I digested P BT556. The insertion of the 
correct sequence was verified by DNA sequencing yielding pBT557. Oligonucleotides SEQ 

10 ID NO:44 and SEQ ID NO:45, which encode the amino terminal part of the chloroplast 
targeting signal, were annealed, resulting in Nco I and Afl II compatible ends, purified via 
polyacrylamide gel electrophoresis, and inserted into Nco I and Afl II digested pBT557. The 
insertion of the correct sequence was verified by DNA sequencing yielding pBT558. Thus 
the mcts was fused to the lvsC -M4 gene. 

1 5 To construct the chimeric gene: globulin 1 promoter/mcts/]ysC-M4/globulin 1 3' 

region an Nco I to Hpa I fragment containing the mcts/iysC-M4 coding sequence was 
isolated from plasmid pBT558 and inserted into Nco I plus Sma I digested pCC50 creating 
plasmid pBT663. 

To construct the chimeric gene: glutelin 2 promoter/mcts/lysC-M4/NOS 3' region the 

20 1 .02 kb BamH I to Nco I glutelin 2 promoter fragment described above was linked to the 

Nco I to Hpa I fragment containing the mcts/iysC-M4 coding sequence described above and 
to a Sma I to Hind III fragment carrying the NOS 3' region creating. 

To construct the chimeric gene: globulin 1 promoter/corn CS coding region/globulin 
1 3' region a 1482 base pair BspH I fragment containing the corn CS coding region (see 

25 Example 4) was isolated and inserted into an Nco I partial digest of pCC50. A plasmid 
designated pML157 carried the CS coding region in the proper orientation to create the 
indicated chimeric gene, as determined via restriction endonuclease digests. 

To construct the chimeric gene: glutelin 2 promoter/corn CS coding region/10 kD 3' 
region the HSZ coding region was removed from pML103 (above) by digestion with Nco I 

30 and Xma I and insertion of an oligonucleotide adaptor containing an EcoR I site and Nco I 
and Xma I sticky ends. The resulting plasmid was digested with Nco I and the 1482 base 
pair BspH I fragment containing the corn CS coding region (see above and Example 4) was 
inserted. A plasmid with the CS coding region in the proper orientation, as determined via 
restriction endonuclease digests, was obtained, creating the indicated chimeric gene. 

35 A corn CS gene that contained the entire chloroplast targeting signal was constructed 

by fusing the 5 1 end of the genomic CS gene to the 3' end of the cDNA. A 697 bp Nco I to 
Sph I genomic DNA fragment replaced the analogous Nco I to Sph I fragment in the cDNA. 
Thus, the first 168 amino acids are encoded by the genomic CS sequence and the coding 
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sequence is interrupted by two introns. The remaining 341 amino acids are encoded by 
cDNA CS sequence with no further introns, resulting in a protein of 509 amino acids in 
length (SEQ ID NO:19). : A 1750 bp Nco I to BspH I DNA fragment that includes the entire 
CS coding region was inserted into the corn embryo and endosperm expression cassettes 
5 resulting in the chimeric genes globulin 1 promoter/corn CS coding region/globulin 1 3' 
region in plasmid pFSl 198 and glutelin 2 promoter/corn CS coding region/10 kD zem 3' 

region in plasmid pFSl 196, respectively. 

EXAMPLE 9 
Construction of Ch imeric Genes for 
10 Fv pression of CS AKTTT-M4. a p H HS7. Proteins in the Seeds of Dicot Plants 

The following chimeric genes were made for transformation into dicot plants: 

phaseolin promoter/scts/lysC-M4/phaseolin 3' region 
KTI3 promoter/scts/corn CS coding region/KTI3 3* region 
1 5 phaeolin promoter/HSZ coding region/phaseolin 3" region 

P_conglycinin-conglycinin promoter/HSZ coding region/phaseolin 3 1 region 

A first seed-specific expression cassette used for expression in dicotyledonous plants 
is composed of the promoter and transcription terminator from the gene encoding the b 

20 subunit of the seed storage protein phaseolin from the bean Phaseolus vulgaris [Doyle et al. 
(1986) J. Biol. Chem. 261:9228-9238]. The phaseolin cassette includes about 500 
nucleotides upstream (5«) from the translation initiation codon and about 1650 nucleotides 
downstream (3 1 ) from the translation stop codon of phaseolin. Between the 5' and 3' regions 
are the unique restriction endonuclease sites Nco I (which includes the ATG translation 

25 initiation codon), Sma I, Kpn I and Xba I. The entire cassette is flanked by Hind III sites. 

A second seed-specific expression cassette used for expression in dicotyledonous 
plants is composed of the promoter from the a' subunit of soybean [3-conglycinin (Glycine 
max) and the transcription terminator from the gene encoding the P subunit of the seed 
storage protein phaseolin from the bean Phaseolus vulgaris (above). The conglycinin 

30 cassette includes 607 nucleotides upstream (5 1 ) from the translation initiation codon of 

soybean (3-conglycinin and about 1650 nucleotides downstream (3') from the translation stop 
codon of phaseolin. Between the 5' and 3' regions are the unique restriction endonuclease 
sites Nco I (which includes the ATG translation initiation codon), Sma I, Kpn I and Xba I. 
The entire cassette is flanked by Hind III sites. 

35 A third seed-specific expression cassette used for expression in dicotyledonous plants 

is composed of the promoter and transcription terminator from the soybean Kunitz trysin 
inhibitor 3 (KTI3) gene [Jofuku et al., Plant Cell, (1989), 7. 427-435]. The KTI3 cassette 
includes about 2000 nucleotides upstream (5') from the translation initiation codon and about 
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240 nuc.eo.ides down*— (3') from .he translation s.op codon of phaseohn. Between the 
I and 3" regions are .he unique res.ric.ion endonudease si.es Nco . (which includes he ATO 
.rans,a.ion initiation codon), Xba 1, Kpn I and Sma .. The en.ire cassette ,s flanked by 

BamH Pl^ amino acid biosyn.he.ie enzymes are known .o be loca.ized in .he ch.oropias, 

and therefore are synthesized wi.h a eh.orop.as. targe.ing signal. ^"^^ 

AKIII have no such signa.. A eh.orop.as. transit sequence (cts) was therefore f^ . ** 

to&M coding sequence in some chimeric genes. The cts used « .based nthe he c of 

th7smal 1 subuni,ofribu,ose,>bisphos P ha,ecarboxy 1 asefrom^ 

J Mo, Appl G.-/, (1982). 7,483-498]. The oligonucleotides SEQ ID NOS/16-5 were 

symtesizS and used as described be,ow. The soybean c, (sets) was also used to replace the 

native corn cts in the corn CS gene. 

Oligonucleotides SEQ ID NO:46 and SEQ ID NO:47, wh.ch encode the -boxy 
terminal part of the chlorop.ast targeting signa,, were annea.ed, resulfin* 
ends, purified via po.yacry.amide ge. e.ectrophoresis, and mserted m o Nco 1 d,ge ed 
PBT461 . The insertion of the correct sequence in .he correc, orientation ™^ 
DNA sequencing yie.ding pBT496. O.igonuc.eo.ides SEQ ID NO:48 and SEQ .D NCM9. 
which encode the amino terminal par. of the chloroplas, targeting s.gnal, were annealed, 
resulting in Nco 1 compa.ib.e ends, purified via polyacrylamide gel deCrophoresis, and 
inserted into Nco I digested P BT496. The insertion of the correc. sequence m the correc 
orientation was venfied by DNA sequencing yielding pBT52, . Thus the sets was fused to 

* e ^ToTse the sets to the !ysC-M4 gene, pBT521 was digested with Sal I, and an 
approximately 900 bp DNA fragment that included the sets and the ^rino _T codmg 
region of fysC was isolated. This fragment was inserted into Sal I dtgested P BT492 
eltivelyiTplacing the amino terminal coding region of 1V.C-M4 wit the fused sets and 
.he amino terminal coding region of !ysC. Since the mutation that resulted m lysine- 
Lnsitivity was no. in the replaced fragmen, .he new plasmid, P BT523, earned the sets 

^ 'a^oTp Nco .-Hpa I fragmen, containing the cts fused to W.C-M4 plus about 
90 bp of 3- non-coding sequence was isolated from pBT523 and inserted into the phaseo m 
seed^pecifc expression cassette digested with Nco I and Sma ., yielding plasm,d pBT544 

carrying the chimeric gene: 

phaseolin P romoter/sc,s/!vsC-M4/phaseolin 3' region. 
Art sets DNA fragment that can be readily inserted in.o dicot gene «P«»°« 
cassettes was crea.ed. Employing PGR wi.h primers CF32 (SEQ ID NO.50) and CF SEQ 
IDNO-51) and any .empla.e DNA carrying the soybean cts, e.g. pBT523 above, results ,n a 
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DNA fragment carrying the entire sets. .This fragment is then cut with Nco I and hgated to 
any gene that carries an Nco I site in-frame with the translation initiation codon. 

The corn CS gene of plasmid pFS1088 (Example 3) was cut with restriction enzyme 
Sst II and the oligonucleotide adaptor shown in SEQ ID NO:52 was self-annealed and 
5 inserted. This removes most of the corn chloroplast transit peptide coding region and adds 
an Nco I site in-frame with the CS coding sequence. A DNA fragment containing the thus 
modified corn CS gene was was obtained by digestion with Nco I and BspH I and hgated 
into the KTI3 expression cassette digested with Nco I. Insertion of the corn CS gene in the 
proper orientation was determined by restriction enzyme mapping. The sets was then added 
10 as an Nco I fragment as described above yielding the chimeric gene: KTI3 
promoter/scts/corn CS coding region/KTI3 3' region. 

The Nco I-Xba I fragment containing the entire HSZ coding region (see Example 7) 
was isolated from an agarose gel following electrophoresis and inserted into the phaseol.n 
and p-conglycinin expression cassettes which had been digested with Nco I-Xba I. Thus the 

1 5 two chimeric genes: 

1) phaseolin 5' region/HSZ/phaseolin 3' region 

2) p-conglycinin 5' region/HSZ/phaseolin 3' region were created. 

F Y AMPLE 10 
Isolation of the F. coli ro <--tT, Gene and 
20 fWtniction of PhW.ric Genes for Expres sion in the Seeds ofPlants 

The metL gene of E coli encodes a Afunctional protein, AKII-HDHII; the AK and 
HDH activities of this enzyme are insensitive to all pathway end-products. The metL gene 
of E coli has been isolated and sequenced previously [Zakin et al., J- Biol. Chem., (1 983), 
258 3028-303 1]. For the present invention a DNA fragment containing the metL gene was 
25 isolated and modified from E. coli genomic DNA obtained from strain LE392 using PCR. 
The following PCR primers were designed and synthesized: 

CF23 5--GAAACCATGG CCAGTGTGAT TGCGCAGGCA-3' SEQ ID NO:53 
30 CF24 S'-GAAAGGTACC TTACAACAAC TGTGCCAGC-3' SEQIDNO:54 

These primers add an Nco I site which includes a translation initiation codon at the amino 
terminus of the AKII-HDHII protein. In order to add the restriction site and additional 
codon, GCC coding for alanine, was also added to the amino terminus of the protein, he 
3 5 primers also add a Kpn I site immediately following the translation stop codon. 

PCR was performed using a Perkin-Elmer Cetus kit according to the instructions of 
the vendor on a thermocycler manufactured by the same company. The primers were at a 
concentration of 10 mM and the thermocycling conditions were: 
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94° 1 min, 50° 2 min, 72° 8 min for 10 cycles followed by 
94° 1 min, 72° 8 min for 30 cycles. 

Reaction, with four different concentrations of template DNA a.l yielded the expected 2.4 kb 
DNA fragment, along with several other smaller fragments. The four PGR reason m.xes 
were pooled, digested with Nco 1 and Kpn I and the 2.4 kb fragments were pun fied and 
isolated from an agarose gel. The fragment was inserted into a modified pBT430 expression 
vector (see Example 2) containing a Kpn I site downstream of the Nco I site at the translation 
initiation codon. DNA was isolated from 8 clones carrying the 2.4 kb fragment „ .the 
PBT430 expression vector and transformed into the expression host stram BL21(DE3). 
Cultures were grown in TB medium containing ampicillin (1 00 mg/L) a. 37°C ovem.ght. 
The cells were collected by centrifuga.ion and resuspended in IQSth the original culture 
volume in 50 mM NaCl; 50 mM Tris-Cl, P H 7.5; 1 mM EDTA, and frozen at -20 C, thawed 
a. 37°C and sonicated, in an ice-water bath, to lyse the cells. The lysate was centnfuged at 
4»P for 5 min at 12,000 rpm. The supernatant was removed and the pellet was resuspended 
in the above buffer. The supernatant fractions were assayed for HDH enzyme activities to 
identify clones expressing functional proteins. HDH activity was assayed as shown below: 

HDH ASSAY 

Stock solutions LQM SJffiji! Finalconc 

0.2MKPO 4 ,pH7.0 500 ul 100ul 100 mM 

3.7MKCI 270 ul 54 ul 

~ on ill 4 ul 10 mM 

0.5 M EDTA 20 Ml " 

i n ..i 2 ul 10 mM 

1.0MMgCl 2 ^ Z ^ Kyr 

ir»n..i 90 ul 0.20 mM 

0 mM NADPH 100 ul zu ui 

M^ M fx7ure of above reagents with amounts ™f^™™^l™Zcr 
Use 0.9 mL of mix for lmL assay; 180 uL of mix for 0.2 mL assay in microliter 

dish 
Add 

l.OMASAinl.ONHCl lul 0.2^1 1-OmM 

to 1/2 the assay mix; remaining 1/2 lacks ASA to serve as blank 
enzyme extract 10-100 ul 2-20 ul 

H0 tol.OmL to 0.20 mL 

Add enzyme extract last to start reaction. Incubate at ~30°C; monitor NADPH 
oxidation at 340 nM. 1 unit oxidizes 1 umol NADPH/min at 30°C in the 1 mL reaction. 

Four of eight extracts showed HDH activity well above the control. These four were 
then assayed for AK activity. AK activity was assayed as shown below: 
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AK ASSAY 

Assay mix (for 12X1 .OmL or 48 X 0.25mL assays): 

2.5 mL H 2 0 

2.0 mL 4M KOH 

2.0 mL 4MNH 2 OH-HCl 

l.OmL lMTris-HCl pH 8.0 

0.5 mL 0.2M ATP (121 mg/ml in 0.2M NaOH) 

50 uL 1M MgS0 4 
pH of assay mix should be 7-8 

Each 1.5 mL eppendorf assay tube contains: 

MACRO assay micro assay 

assay mix «-64 ™ L 016 mL 

0.2M L-Aspartate 0.04 mL 0.01 mL 

5-120 uL 1-30 ul 

H 2 0 to total vol. 0.8 mL 0.2 mL 

Assay tubes are incubated at 30°C for 30-60 min 
Add to develop color; 

FeCl 3 reagent 0.4 mL 0.1 mL 

FeCl 3 reagent is: 1 0% w/v FeCl 3 50 g 

3.3% TCA 15.5 g 

0.7% HC1 35 mL HC1 

H 2 0 to 500 mL 

Spin for 2 min in eppendorf centrifuge tube. 
Read OD at 540 nm. 

Two extracts also had high levels of AK enzyme activity. These two extracts were 
then tested for inhibition of AK or HDH activity by the pathway end-products, lys, thr and 
met. Neither the AK nor the HDH activity of the extract from clone 5 was inhibited by 
30 mM concentrations of any of the end-products. 

The supernatant and pellet fractions of several of the extracts were also analyzed by 
SDS polyacrylamide gel electrophoresis. In the extract from clone 5, the major protein 
visible by Coomassie blue staining in both the pellet and supernatant fractions had a 
molecular weight of about 85 kd, the expected size for AKII-HDHII. The metL gene in 
plasmid pBT718 from clone 5 was used for all subsequent work. 

Plant amino acid biosynthetic enzymes are known to be localized in the chloroplasts 
and therefore are synthesized with a chloroplast targeting signal. Bacterial proteins have no 
such signal A chloroplast transit sequence (cts) was therefore fused to the metL coding 
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sequence in .he chimeric genes described below. For corn the c,s used was based on .he the 
ctsof the smail subuni. of ribulose 1 ,5-bisphospha.e carboxylase from corn [Lebrun e. al. 
(1987) Nucleic Acids Res. /5:4360] and is designated mcts. 

01igonuc.eo.ides SEQ ID NO:40 and SEQ ID N0:41. which encode the carboxy 
terminal part of the com chloroplas. targeting signal, were annealed, resu.tmg ,n Xba I and 
Nco 1 com P a.ib.e ends, purified via polyacrylamide gel electrophoresis, and mserted ,n,o 
Xba I plus Nco I digested pBT7 1 8. The insertion of the correct sequence was venfied by 
DNA sequencing yielding pBT725. To comp.ete the com chloroplas. taxgeung s.gna 
PBT725 was diges.ed with Bgl II and Xba I, and a 1.14 kb BamH I to Xba I fragmen. from 
pBT580 containing the glutelin 2 promoter plus the amino terminal part of the corn 
chloroplast targeting signal was inserted creating pBT726. 

To construct the chimeric gene: 

globulin 1 promoter/mcts/metL/globulin 1 3' region 
(he 2 6 kb Nco I to Kpn I fragment containing the mcts/metL coding sequence was .sola.ed 
from p.asmid P BT726 and inserted in.o Nco I plus Kpn I digested pCCSO crea.mg plasmtd 

pBT727. 

To construct the chimeric gene: 

glutelin 2 promoter/mcts/rnetL/NOS 3' region 
the 2 6 kb Nco I to Kpn I fragment containing the mcts/metL coding sequence was .solated 
from plasmid P BT726 and linked to the 1 .02 kb BamH I to Nco I glutelin 2 promoter 
fragmen, described in Example 8 and to a Kpn I to Hind III fragment carrymg the NOS 
region creating plasmid pBT728. 

To construct the chimeric gene: 

phaseolin promoter/scts/metL/phaseolin 3' region 
the 2 4 kb Nco I to Kpn I fragment containing the metL coding sequence was isolated from 
plasmid P BT718 and inserted into Nco I plus Kpn I digested phaseolin express^ cassette. 
The sets was then added as an Nco I fragment as described in Example 8. 

EXAMPLE 1 1 

Co.^truction of rhimeric MS Genes for Expression in the Seeds of Plants 
The following chimeric genes were made for transformation into monocot plants: 
globulin 1 promoter/tobacco MS coding region/globulin 1 3' region; 
glutelin 2 promoter/tobacco MS coding region/NOS 3' region. 
To construct the chimeric gene: , 
globulin 1 promoter/tobacco MS coding region/globulin 1 3 re gl on, the 2300 bp 
BspH I-Kpn I fragment.containing the tobacco MS coding sequence was .solated 
' ' from plasmid P BT773 and inserted into Nco I plus Kpn I digested pCCSO 
(Example 8). 
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To construct the chimeric gene: , , 

g ute.in 2 promoter/tobacco MS coding region/NOS 3' reg.on, the 2300 bp 
Bs 'h I-Kpn I fragment containing the tobaceo MS ending sequence w s .solated 
, H RT773 and linked to the 1 .02 kb BamH I to Neo 1 gluteim 2 

carrying the NDS 3' region. 
The following chimeric gene was made for transformation mto dice* plants. 
KTI3 promoter/tobacco MS coding region/KTI3 3' region. 

To construct the chimeric gene: vr»n T 

,. /vtti i' rpainn the 2300 bp BspH I-Kpn 1 

KTI3 promoter/tobacco MS coding region/KTI3 3 region, the z^u P y 

fr I n, contaming the tobaeeo MS coding science was isolated ton, »BT773 
and inserted into the Neo I plus Kpn . digested KT,3 express.on cassette desenbed 
Example 9. 

FY AMPLE 12 

~, • *i_-r* i THinn-.it the A ifti'"' t Y <"vf Methion ine Synthase 

enzymes, p ^Xr-vn maltose binding protein, and C- and/or 

o transferase ("GST"), thioredoxin ( Trx ), maltose Dinamg p 

S-transterase l ua h f - Af , r <m\^ The fusion proteins may be engineered 

digestion 

thrombin, enterokinase and factor Xa. However, any proi 
cleaves the peptide connecting the fusion protein and the enzyme. 

PuriLation of the instant plant methionine synthases, if des.red, may ut.l.ze any 
number of s aration technologies fanuliar to those skilled in the art of pro.em pnn .eat.on. 
Wes of nch methods inclnde, bu, are no. limited to, homogen.zat.on, filtrahon, 

Lrifugat.on heat ^^^^^^^^ "* 

spec! for the enzyme. For example, the instant plan, methiomne synthase may be 
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expressed as a fusion protein coupled to .he C-.erminus of thioredoxm. In addmon, a (H.s) 6 
Z de may be engineered into the N-.erminus of the fused thioredoxm mo.e.y to afford 
a Z Lai opportunities for affinity purification. Other suitab.e affiru.y res.ns could be 

hesized by linking the appropriate ligands to any suitable resin such as Sepharo,c-4B. .n 
an alternate embodiment, a thioredoxin fusion protein may be elu.ed us.ng ^h'othre.to. 
Lever, elufion may be accomp.ished using other reagent which interact to d,p ace the 
thioredoxin from the resin. These reagents include p-mercap.oethanol or other reduced 
Z L e.u.ed fusion pro.ein may be subjected to fiuther purification by means 
as stated above, if desired. Proteolytic cleavage of the thioredoxm fusmn P'~ d 
Lyme may be accomp.ished after the fusion pro.ein is purified or wh.le ,he profe.n .s s„„ 
bound to the ThioBond™ affinity resin or other res.n. 

Crude, partially purified or purified enzyme, either alone or as a fus.on pro.e.n, may 
be utilized in assays for .he evaluation of compounds for their ability to inh.b,, enzymafic 
ac.iva.ion of .he instant plant methionine synthase disclosed herein. Assays may be 
conducted under well known experimental conditions which permh t opt.mai enzyma ,c 
activity. For example, assays for methionine synthase are presented by E.chel etal. (1995) 
EurJBiochem 250:1053-1058. 
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